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HUMAN TELOMERASE 



ATGCCGCGCGCTCCCCGCTGCCGAGCCGTGCGCTCCCTGCTGCGCAGCCACTACCGCGAG 60 
MetProArgAlaProArgCysArgAlaValArgSerLeuLeuArgSerHisTyrArgGlu 20 



GTGCTGCCGCTGGCCACGTTCGTGCGGCGCCTGGGGCCCCAGGGCTGGCGGCTGGTGCAG 120 
ValLeuProLeuAlaThrPheValArgArgLeuGlyProGlnGlyTrpArgLeuValGln 40 



CGCGGGGACCCGGCGGCTTTCCGCGCGCTGGTGGCCCAGTGCCTGGTGTGCGTGCCCTGG 180 
ArgGlyAspProAlaAlaPheArgAlaLeuValAlaGlnCysLeuValCysValProTrp 60 



GACGCACGGCCGCCCCCCGCCGCCCCCTCCTTCCGCCAGGTGTCCTGCCTGAAGGAGCTG 240 

AspAlaArgProProProAlaAlaProSerPheArgGlnValSerCysLeuLysGluLeu 80 

GTGGCCCGAGTGCTGCAGAGGCTGTGCGAGCGCGGCGCGAAGAACGTGCTGGCCTTCGGC 300 

ValAlaArgValLeuGlnArgLeuCysGluArgGlyAlaLysAsnValLeuAlaPheGly 100 

TTCGCGCTGCTGGACGGGGCCCGCGGGGGCCCCCCCGAGGCCTTCACCACCAGCGTGCGC 360 

PheAlaLeuLeuAspGlyAlaArgGlyGlyProProGluAlaPheThrThrSerValArg 120 

AGCTACCTGCCCAACACGGTGACCGACGCACTGCGGGGGAGCGGGGCGTGGGGGCTGCTG 420 

SerTyrLeuProAsnThrValThrAspAlaLeuArgGlySerGlyAlaTrpGlyLeuLeu 140 



TTGCGCCGCGTGGGCGACGACGTGCTGGTTCACCTGCTGGCACGCTGCGCGCTCTTTGTG 480 
LeuArgArgValGlyAspAspValLeuValHisLeuLeuAlaArgCysAlaLeuPheVal 160 



CTGGTGGCTCCCAGCTGCGCCTACCAGGTGTGCGGGCCGCCGCTGTACCAGCTCGGCGCT 540 
LeuValAlaProSerCysAlaTyrGlnValCysGlyProProLeuTyrGlnLeuGlyAla 180 



GCCACTCAGGCCCGGCCCCCGCCACACGCTAGTGGACCCCGAAGGCGTCTGGGATGCGAA 600 
AlaThrGlnAlaArgProProProHisAlaSerGlyProArgArgArgLeuGlyCysGlu 200 



CGGGCCTGGAACCATAGCGTCAGGGAGGCCGGGGTCCCCCTGGGCCTGCCAGCCCCGGGT 660 
ArgAlaTrpAsnHisSerValArgGluAlaGlyValProLeuGlyLeuProAlaProGly 220 



GCGAGGAGGCGCGGGGGCAGTGCCAGCCGAAGTCTGCCGTTGCCCAAGAGGCCCAGGCGT 720 jp' 4 A 

AlaArgArgArgGlyGlySerAlaSerArgSerLeuProLeuProLysArgProArgArg 240 




GGCGCTGCCCCTGAGCCGGAGCGGACGCCCGTTGGGCAGGGGTCCTGGGCCCACCCGGGC . 780 
GlyAlaAlaProGluProGluArgThrProValGlyGlnGlySerTrpAlaHisProGly 260. 



AGGACGCGTGGACCGAGTGAGCGTGGTTTCTGTGTGGTGTCACCTGCCAGACCCGCCGAA 840 
ArgThrArgQlyProSorAspArgaiyPheCyeValValSerProAlaAraProAlaaiu 380 



GAAGCCACCTCTTTGGAGGGTGCGCTCTCTGGCACGCGCCACTCCCACCCATCCGTGGGC 900 
GluAlaThrSerLeuGluGlyAlaLeuSerGlyThrArgHisSerHisProSerValGly 300 



CGCCAGCACCACGCGGGCCCCCCATCCACATCGCGGCCACCACGTCCCTGGGACACGCCT 960 
ArgGlnHisHisAlaGlyProProSerThrSerArgProProArgProTrpAspThrPro 320 



TGTCCCCCGGTGTACGCCGAGACCAAGCACTTCCTCTACTCCTCAGGCGACAAGGAGCAG 1 020 
CysProProValTyrAlaGluThrLysHisPheLeuTyrSerSerGlyAspLysGluGln 340 



CTGCGGCCCTCCTTCCTACTCAGCTCTCTGAGGCCCAGCCTGACTGGCGCTCGGAGGCTC 1080 

LeuArgProSerPheLeuLeuSerSerLeuArgProSerLeuThrGlyAlaArgArgLeu 360 

GTGGAGACCATCTTTCTGGGTTCCAGGCCCTGGATGCCAGGGACTCCCCGCAGGTTGCCC 1 1 40 

ValGluThrllePheLeuGlySerArgProTrpMetProGlyThrProArgArgLeuPro 380 

CGCCTGCCCCAGCGCTACTGGCAAATGCGGCCCCTGTTTCTGGAGCTGCTTGGGAACCAC 1 200 

ArgLeuProGlnArgTyrTrpGlnMetArgProLeuPheLeuGluLeuLeuGlyAsnHis 400 



GCGCAGTGCCCCTACGGGGTGCTCCTCAAGACGCACTGCCCGCTGCGAGCTGCGGTCACC 1260 
AlaGlnCysProTyrGlyValLeuLeuLysThrHisCysProLeuArgAlaAlaValThr 420 



CCAGCAGCCGGTGTCTGTGCCCGGGAGAAGCCCCAGGGCTCTGTGGCGGCCCCCGAGGAG 1 320 

ProAlaAlaGlyValCysAlaArgGluLysProGlnGlySerValAlaAlaProGluGlu 440 

GAGGACACAGACCCCCGTCGCCTGGTGCAGCTGCTCCGCCAGCACAGCAGCCCCTGGCAG 1380 

GluAspThrAspProArgArgLeuValGlnLeuLeuArgGlnHisSerSerProTrpGln 460 

GTGTACGGCTTCGTGCGGGCCTGCCTGCGCCGGCTGGTGCCCCCAGGCCTCTGGGGCTCC 1440 

ValTyrGlyPheValArgAlaCysLeuArgArgLeuValProProGlyLeuTrpGlySer 480 

AGGCACAACGAACGCCGCTTCCTCAGGAACACCAAGAAGTTCATCTCCCTGGGGAAGCAT 1500 Fig. IB 

ArgHisAsnGluArgArgPheLeuArgAsnThrLysLysPhelleSerLeuGlyLysHis 500 




GCCAAGCTCTCGCTGCAGGAGCTGACGTG6AAGATGAGCGTGCGGGGCTGCGCTTGGCTG .1560 
AlaLysLeuSerLeuGlnGluLeuThrTrpLysMetSerValArgAspCysAlaTrpLeu 520 



CGCAGGAGCCCAGGGGTTGGCTGTGTTCCGGCCGCAGAGCACCGTCTGCGTGAGGAGATC 1 620 

ArgArgSerProGlyValGlyCysValProAlaAlaGluHisArgLeuArgGluGluIle 540 

CTGGCCAAGTTCCTGCACTGGCTGATGAGTGTGTACGTCGTCGAGCTGCTCAGGTCTTTC 1 680 

LeuAlaLysPheLeuHisTrpLeuMetSerValTyrValValGluLeuLeuArgSerPhe 560 



TTTTATGTCACGGAGACCACGTTTCAAAAGAACAGGCTCTTTTTCTACCGGAAGAGTGTC 1 740 
PheTyrValThrGluThrThrPheGlnLysAsnArgLeuPhePheTyrArgLysSerVal 580 



TGGAGCAAGTTGCAAAGCATTGGAATCAGACAGCACTTGAAGAGGGTGCAGCTGCGGGAG 1 800 
TrpSerLysLeuGlnSerlleGlylleArgGlnHisLeuLysArgValGlnLeuArgGlu 600 



CTGTCGGAAGCAGAGGTCAGGCAGCATCGGGAAGCCAGGCCCGCCCTGCTGACGTCCAGA 1860 

LeuSerGluAlaGluValArgGlnHisArgGluAlaArgProAlaLeuLeuThrSerArg 620 

CTCCGCTTCATCCCCAAGCCTGACGGGCTGCGGCCGATTGTGAACATGGACTACGTCGTG 1 920 

LeuArgPhelleProLysProAspGlyLeuArgProIleValAsnMetAspTyrValVal 640 



GGAGCCAGAACGTTCCGCAGAGAAAAGAGGGCCGAGCGTCTCACCTCGAGGGTGAAGGCA 1 980 
GlyAlaArgThrPheArgArgGluLysArgAlaGluArgLeuThrSerArgValLysAla 660 



CTGTTCAGCGTGCTCAACTACGAGCGGGCGCGGCGCCCCGGCCTCCTGGGCGCCTCTGTG 2040 
LeuPheSerValLeuAsnTyrGIuArgAlaArgArgProGlyLeuLeuGlyAlaSerVal 680 



CTGGGCCTGGACGATATCCACAGGGCCTGGCGCACCTTCGTGCTGCGTGTGCGGGCCCAG 2100 
LeuGlyLeuAspAspIleHisArgAlaTrpArgThrPheValLeuArgValArgAlaGln 700 



GACCCGCCGCCTGAGCTGTACTTTGTCAAGGTGGATGTGACGGGCGCGTACGACACCATC 2160 

AspProProProGluLeuTyrPheValLysValAspValThrGlyAlaTyrAspThrlle 720 

CCCCAGGACAGGCTCACGGAGGTCATCGCCAGCATCATCAAACCCCAGAACACGTACTGC 2220 

ProGlnAspArgLeuThrGluVallleAlaSerllelleLysProGlnAsnThrTyrCys 740 



GTGCGTCGGTATGCCGTGGTCCAGAAGGCCGCCCATGGGCACGTCCGCAAGGCCTTCAAG 2280 
ValArgArgTyrAlaValValGlnLysAlaAlaHisGlyHisValArgLysAlaPheLys 760 
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AGCCACGTCTCTACCTTGACAGACCTCCAGCCGTACATGCGACAGTTCGTGGCTCACCTG 2340 

SerHisValSerThrLeuThrAspLeuGlnProTyrMetArgGlnPheValAlaHisLeu 780 

CAGGAGACCAGCCCGCTGAGGGATGCCGTCGTCATCGAGCAGAGCTCCTCCCTGAATGAG 2400 

GlnGluThrSerProLeuArgAspAlaValVallleGluGlnSerSerSerLeuAsnGlu 800 

GCCAGCAGTGGCCTCTTCGACGTCTTCCTACGCTTCATGTGCCACCACGCCGTGCGCATC 2460 

AlaSerSerGlyLeuPheAspValPheLeuArgPheMetCysHisHisAlaValArglle 820 

AGGGGCAAGTCCTACGTCCAGTGCCAGGGGATCCCGCAGGGCTCCATCCTCTCCACGCTG 2520 

ArgGlyLysSerTyrValGlnCysGlnGlylleProGlnGlySerlleLeuSerThrLeu 840 

CTCTGCAGCCTGTGCTACGGCGACATGGAGAACAAGCTGTTTGCGGGGATTCGGCGGGAC 2580 

LeuCysSerLeuCysTyrGlyAspMetGluAsnLysLeuPheAlaGlylleArgArgAsp 860 

GGGCTGCTCCTGCGTTTGGTGGATGATTTCTTGTTGGTGACACCTCACCTCACCCACGCG 2640 

GlyLeuLeuLeuArgLeuValAspAspPheLeuLeuValThrProHisLeuThrHisAla 880 

AAAACCTTCCTCAGGACCCTGGTCCGAGGTGTCCCTGAGTATGGCTGCGTGGTGAACTTG 2700 

LysThrPheLeuArgThrLeuValArgGlyValProGluTyrGlyCysValValAsnLeu 900 



CGGAAGACAGTGGTGAACTTCCCTGTAGAAGACGAGGCCCTGGGTGGCACGGCTTTTGTT 2760 

ArgLysThrValValAsnPheProValGluAspGluAlaLeuGlyGlyThrAlaPheVal 920 

CAGATGCCGGCCCACGGCCTATTCCCCTGGTGCGGCCTGCTGCTGGATACCCGGACCCTG 2820 

GlnMetProAlaHisGlyLeuPheProTrpCysGlyLeuLeuLeuAspThrArgThrLeu 940 

GAGGTGCAGAGCGACTACTCCAGCTATGCCCGGACCTCCATCAGAGCCAGTCTCACCTTC 2880 

GluValGlnSerAspTyrSerSerTyrAlaArgThrSerlleArgAlaSerLeuThrPhe 960 



AACCGCGGCTTCAAGGCTGGGAGGAACATGCGTCGCAAACTCTTTGGGGTCTTGCGGCTG 2940 

AsnArgGlyPheLysAlaGlyArgAsnMetArgArgLysLeuPheGlyValLeuArgLeu 980 

AAGTGTCACAGCCTGTTTCTGGATTTGCAGGTGAACAGCCTCCAGACGGTGTGCACCAAC 3000 

LysCysHisSerLeuPheLeuAspLeuGlnValAsnSerLeuGlnThrValCysThrAsn 1000 

ATCTACAAGATCCTCCTGCTGCAGGCGTACAGGTTTCACGCATGTGTGCTGCAGCTCCCA 3060 

IleTyrLysIleLeuLeuLeuGlnAlaTyrArgPheHisAlaCysValLeuGlnLeuPro 1020 



TTTCATCAGCAAGTTTGGAAGAACCCCACATTTTTCCTGCGCGTCATCTCTGACACGGCC 3120 
PheHisGlnGlnValTrpLysAsnProThrPhePheLeuArgVallleSerAspThrAls 1040 



TCCCTCTGCTACTCCATCCTGAAAGCCAAGAACGCAGGGATGTCGCTGGGGGCCAAGGGC 3180 
SerLeuCysTyrSerlleLeuLysAlaLysAsnAlaGlyMetSerLeuGlyAlaLysGly 1060 



GCCGCCGGCCCTCTGCCCTCCGAGGCCGTGCAGTGGCTGTGCCACCAAGCATTCCTGCTC 3240 
AlaAlaGlyProLeuProSerGluAlaValGlnTrpLeuCysHisGlnAlaPheLeuLeu 1080 



AAGCTGACTCGACACCGTGTCACCTACGTGCCACTCCTGGGGTCACTCAGGACAGCCCAG 3300 
LysLeuThrArgHisArgValThrTyrValProLeuLeuGlySerLeuArgThrAlaGln 1 1 00 



ACGCAGCTGAGTCGGAAGCTCCCGGGGACGACGCTGACTGCCCTGGAGGCCGCAGCCAAC 3360 
ThrGlnLeuSerArgLysLeuProGlyThrThrLeuThrAlaLeuGluAlaAlaAlaAsn 1 1 20 



CCGGCACTGCCCTCAGACTTCAAGACCATCCTGGACtgatggccacccgcccacagccag 3420 

ProAlaLeuProSerAspPheLysThrlleLeuAsp 1132 

Gccgagagcagacaccagcagccctgtcacgccgggctctacgtcccagggagggagggg 3480 

Cggcccacacccaggcccgcaccgctgggagtctgaggcctgagtgagtgtttggccgag 3540 

gcctgcatgtccggctgaaggctgagtgtccggctgaggcctgagcgagtgtccagccaa 3600 

gggctgagtgtccagcacacctgccgtcttcacttccccacaggctggcgctcggctcca 3660 

ccccagggccagcttttcctcaccaggagcccggcttccactccccacataggaatagtc 3720 

catccccagattcgccattgttcacccctcgccctgccctcctttgccttccacccccac 3780 

eatccaggtggagaccctgagaaggaccctgggagctctgggaatttggagtgaccaaag 3840 

gtgtgccctgtacacaggcgaggaccctgcacctggatgggggtccctgtgggtcaaatt 3900 

ggggggaggtgctgtgggagtaaaatactgaatatatgagtttttcagttttgaaaaaaa 3960 

aaaa 3964 



Fig. IE 



Euolotes 

HT1 

EST2 



1 ME VD /DNQADNHG I HS4 LKTC EE I YSW/QK V I RC/? - -NQSQShTKDLEOIs 

1 RRLGPQGWRLV0RGDPAAFRALVAQCL/CVPWDAR-PPR4APSFRQVSCUfEiVARVLQRLCE/?GAKNVLAFGFALZ.DGA 
1 MKlLFiF IQQKLQiQ ■ ■iQTHS'YK&lLKZQ 



Euplotes 56 IFAQTNIVArPRDYNEEDFKVMRK EVFSTGLMI£/./DKCLVE/./.SSSDVSDRCKLCCFGFCLvGWC-LAn 

HT1 80 RGGPPEAFT7SVRSYLFAn"VT[MLRGSGAWGLLLRRVGD0VLVHi.LARCALFVLVAPSCAY---QVCGFPLYGLGMTQA 
EST2 30 HFNGLDEIL7T-CFAZ./WSRKL4Z.P - - - - CLPfiDLSHKAV/OHCI IYLiTGELYNN— VLTF6YKIAIWE0VNK 



Euplotes 126 THLLX4LSTQKQYFFQDEKWQVRAMIGNELFRHLYTKaiF(?RTSEGTLVC>FCG/VNVFDHLKVNDKF0KKQKGGAA0MNE 
HT1 157 RPPPtt4SGPRRRiGCEFMWWHSVREAGVPiGLPAPGARRRGGSASRSLPLPKRPR/?GAAPEPERTPt/G0GSWAHPGRTRG 
EST2 97 SLFCWSANVNVT/.LKGA4WKMFHSLVGTYAFV0/.LIN)'TVI(?FNGQ-FFTOIVGA//?CNEFHLPPKWl/QRSSSS 



Euplotes 206 PRCCSTCKYNVKNEKDHFUN I NVFNWNNMKSRTRi FKCTHFAfRNNOFF 

HT1 237 PSDRGFCVVSPARPAfEATSLEGALSGTRHSHPSVGRQHHAGPPSTSRPPRPWDTPCPPVYAET/f HF£ XSSGDK- - EOLR 
EST2 169 SATAAQI/f QL TfPV f/V - KQFLHKLNW-SSSFF 



Euplotes 255 KKHEFV5NKNN/SAM -DRAQ77FTN I F/?FN/?JRKK/./CDKVIEKIAYM/.EKV/(DFA/FNYYLTKSCF/.PfNWRE 

HT1 315 PSFLI^RPSLTGARWVF7JFLGSRPWMPGTPRRtP/?LWY - WQMRPLFLEtiGNWAQCP yGVL±KTHCPZ.RAAVTP 

EST2 200 PYSKILPSSSSJKKLTOLRFA/FP TNtVK7P0/?/./CVRINLTLQKL/.KR/y/(RLA/yVSI/.NSICPPLfGT- - 

Telomerase domain 

Euplotes 326 RK Q/fIENLINKT/?ffKS--KYYEE/.FSYTTDNKCn"QF/NEFFYNI/.FKDFLTGy?-/VRKNFQ/CKV^nVE/.N/CWE 

HT1 394 MGVCARE/CPQGSVMPEfFDTDPRRLVQaRQHSSPWQl'YGFVP^Ci.RRiVFPGLWGS/?H«ERRFLRyVT/C/(FIS/.Gm 

EST2 268 VLD/.SHLSKQ SPKERl'LKF/IVI/.QK/./.FQEMFGSKKA/KGKII^AfLNLLLS/.PLNG 



Eupl otes 398 LIHKNLaE/riNTREISWMQVET-SAKHFYYFDHE/VIYVIW/CL/.RWIFED/.KI/SL/RCFFl'/Tf QQKSYSKTYy//?i(NIW 

HT1 474 KLSLQE/.TWMSV/MCAM.RR5PGVGCVPMEHRLREEI/.^ 

EST2 324 YLPFDS/./.K/CLRLKPFMFISD-IWFTKHNFENiAf-QUICFISyi.FRQUPKIZQTFFKCrflS-STVTIV/FfflDW 

Motif 1 Motif2 

Euplotes 477 0VIMKMSJADLK/C-ETLA£VQFKfl/EEWKKSL-GFAPG/(/./?L7F/(m--F/?F/MTFNKKIVNSDRK--™LnNTKLL 

HT1 554 SmjSIGIRQHLKRVQiRf LSf Af l/RQHREARPALLTSR/.KF7W(PDG- - LRP7VNMDYVVGARTFRREKRAF RLTSRVK 

EST2 401 IWfL ITPF7VEYF/C - TYZ.VE NNVCRNHNSYTLS - NFNHS/CMffl JP/flf SNNEfRI /AI PCRGADEEEFT - - 1 Y/CFNHKNAIQ 
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Euplotes 551 NSHlMiKT/.KAf-/?MFKDPFGFAyFNY0OVM^ 

HTl 632 ALFSV/.N/E/?AR/?- • PGLLSAS I'LGLDD I HRAWRTF ^RVRAQDPPPf/.KfV/fVDl'TGAKDTJPQDR/.TEVI AS i IKPQN 
EST2 477 PTQKUE//./?A^PTSFTKIYSPTQIAORI/(EFKQRL^KFNNVLP-aK/WFDl'KSCraSJP/?MECMRI^DAL^NtW 



Euplotes 629 O/^IMTAQILKRKWNIl'IDSKWFRKKEMKDYFRQKFQKIALEGGQYPTLFSVLENEiJNDLNAKKTLIVFAK-CRW'f^:-;: 1 

HTl 710 TYCKRRYAVVQKAAHGHVRKAFKSHVS TLTDLQPYMRQFVAHL0E7"SPiR0AVWfQSSSUVEASSG 

EST2 556 GFF^/?SQYFFN-T//TG^LKLF/VVVN A--SRVPKPYELYI0MVR7VHLSNQDWNVV-EMEI«T- 

Motif B Motif C 

Euplotes 708 NLLQP^INICQYNYINFN^FyKOTO/WLCVSSI/.SSFYyAT/.ffSSLGFLRDESMNPENPNVNaW/.TDDY/././T 

HTl 777 LFOVFLRFMCHHAVRIR-fi/(S)'V(?CQ(JJWSI/.STU.CSLC)'GDMfN- - -KLF/1GIRRD GLLLRLWDFLLVT 

EST2 616 --ALWI/EDKCYIR EDSLFOSSSLSAPIVDLVyDDLLfFYSEFMSPSQD UlLUWDFLUS 

Motif D Motif E 

Euplotes 788 7"QENN4 V IFIEKL I N VSRFNGFKFWMKKLQTSFP/.SP5KFAKYGMD5VEEQN I /QDYCOWIG i S I DMK7LALMPN IA/LR I 
HTl 847 PHLTH4KTFLRT/.VRGVPfYGCVVAfLR^TVVNFPVEDEALGG-TAFV(?MPAHGLFPWCGLLL0TRT/.fl'Q$DYSSYAR-- 
EST2 677 TDQQ-Q^INIKJKLAMG- - - - GFQK YNAKANRDK I LAVS SQSDDDTt/IQFCAMHIFVKE/.f WKHSSTMW- - - 



Euplotes 868 EGILCTLA/LWMQTKKASMWLKKKLKSFLMNNIffl^ 

HTl 924 TSJRAS/.TFA/RGFACAGRNMRR/CLFGVZ.RLKCHSLFLDLQVNSLQTVCTNIY/CILLLQAKRFHACVLQLPFHCOVW/CA/PFF 
EST2 741 A/FHIRS/(SS----/(GIFRS/.IALFNrRISYmorNLNSTNrVLMQIDHVVKNISECY^SA--FKOLSIWVrQ 



Euplotes 946 SSMIDLEVSKJ/ySVTR4FFKYLVCNI/0?TIFGEfHYPDFF/.ST/.KHFIf IFSTKKYIFNRVCMJ/.KAKFAK/.KSDQCQS 
HTl 1004 FLRVISDTASLCKSILMKNAGMSLGA/CGAAGPLPSEAVQW/.C-HQAF/.LK/.TRHRVTYVPLLGS/.RTAQTaSRKLPGT 
EST2 808 NMQFH5FLQRJJEMTVSG CPITKC£>PLIEYfVR--FTI--iNGF/.fS/.SSNTSKF-K0NI7/.LRXFI(?HLQAYIYI 



Euplotes 1026 L7QYDA 

HTl 1083 TLTALEAAANPAIPSDFKTILD 
EST2 879 YJHIVN-- 
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j|j ^Alternative intron(s)/exon(s) deleted 

[§ Alternative intron(s)/exon(s) inserted or unspliced intron 
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1950 1952 

1 5 ' - AAAGAGG | GTGGCTG AACAGAA | GCCGAGC - 3 ' 

2130 2167 
a 5 ' -TGTCAAG | gtggatg cccccag| GACAGGC - 3 ' 

2286 2468 
b 5'-GAGCCAC|gtctcta ggggcaa | GTCCTAC-3 ' 

2843 2844 

2 5 ' -ACTCCAG | GTGAGCG XXXXXXX | CTATGCC - 3 ' 

3157 

3 5 ' -AACGCAG | CCGAAGAAAACATTTCTGTCGTGACTCCTGCGGTGCTTGGGTCGGGACAGCCAGAGATGG 

T A A EENILVVTPAVLGSGQPEME 

AGCCACCCCGCAGACCGTCGGGTGTGGGCAGCTTTCCGGTGTCTCCTGGGAGGGGAGTTG 
PPRRPSGVGSFPVSPGRGVG 

3158 

GGCTGGGCCTGTGACTCCTCAGCCTCTGTTTTCCCCCAG | GGATGTC - 3 ' 
L G L * 
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-400 
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GACGTGGAAGATGAGC6T6CGG6ACTGCGCTTGGCTGCGCAGGAGCCCAGGGGTTGGCTGTGTTCCGGCCGCAGAGCACCGTCTGCGTGAGGAGATCCT6GCCAAGTTCCTGCACTGGCT 
TWKMSVROCAWLRRSPGVGCVPAAEHRLREE I LAKFLHWL 

GATGAGTGTGTACGTCGTCGAGCTGCTCAGGTCTTTCTTTTATGTCACGGAGACCACGTTTCAAAAGAACAGGCTCTTTTTCTACCGGAAGAGTGTCTGGAGCAAGTTGCAAAGCATTGG 
MSVYVVELLRSFFYVTETTFQKNRIFFYRKSVWSKLQSIG 

AATCAGACAGCACTTGAAGAGGGTGCAGCTGCGGGAGCTGTCGGAAGCAGAGGTCAGGCAGCATCGGGAAGCCAGGCCCGCCCTGCTGACGTCCAGACTCCGCTTCATCCCCAAGCCTGA 
IRQHLKRVQLRELSEAEVRQHREARPALLTSRLRFiPKPD 

CGGGCTGCGGCCGATTGTGAACATGGACTACGTCGTGGGAGCCAGAACGTTCCGCAGAGAAAAGAGGGCCGAGCGTCTCACCTCGAGGGTGAAGGCACTGTTCAGCGTGCTCAACTACGA 
G L R P I V H H 0 Y V V G A R T F R R E K R A E R L T S R V K A L F S V L N Y E 

GCGGGCGCGGCGCCCCGGCCTCCTGGGCGCCTCTGTGCTGGGCCTGGACGATATCCACAGGGCCTGGCGCACCTTCGTGCTGCGTGTGCGGGCCCAGGACCCGCCGCCTGAGCTGTACTT 
R A R R P G L L G A S V L G L 0 0 i H R A H R T F V L R V R A Q 0 P P P E L Y F 

TGTCAAGGTGGATGTGACGGGCGCGTACGACACCATCCCCCAGGACAGGCTCACGGAGGTCATCGCCAGCATCATCAAACCCCAGAACACGTACTGCGTGCGTCGGTATGCCGTGGTCCA 
V K V 0 V T G A Y D T 1 P Q 0 R L T E V I A S I I K P Q N T Y C V R R Y A V V Q 

GAAGGCCGCCCATGGGCACGTCCGCAAGGCCTTCAAGAGCCAC 
KAAHGHVRKAFKSH 

GTCCTACGTCCAGTG 
V L R P V 

CCAGGGGATCCCGCAGGGCTCCATCCTCTCCACGCTGCTCTGCAGCCTGTGCTACGGCGACATGGAGAACAAGCTGTTTGCGGGGATTCGGCGGGACGGGCTGCTCCTGCGTTTGGTGGA 
P G 0 P A G L H P L H A A L Q P V L R R H G E Q A V C G D S A G R A A P A F G G 

TGATTTCTTGTTGGTGACACCTCACCTCACCCACGCGAAAACCTTCCTCAGGACCCTGGTCCGAGGTGTCCCTGAGTATGGCTGCGTGGTGAACTTGCGGAAGACAGTGGTGAACTTCCC 



Fig. 11AA 



Reference protein (ver. 2) 



ATGCCGCGCGCTCCCCGCTGCCGAGCCGTGCGCTCCCTGCTGCGCAGCCACTACCGCGAG 60 
MetProArgAlaProArgCysArgAlaValArgSerLeuLeuArgSerHisTyrArgGlu 20 

GTGCTGCCGCTGGCCACGTTCGTGCGGCGCCTGGGGCCCCAGGGCTGGCGGCTGGTGCAG 120 
Val LeuProLeuAl aThrPheVal ArgArgLeuGlyProGl nGlyTrpArgLeuVal Gl n 40 

CGCGGGGACCCGGCGGCTTTCCGCGCGCTGGTGGCCCAGTGCCTGGTGTGCGTGCCCTGG 180 
ArgGlyAspProAlaAlaPheArgAlaLeuValAl aGl nCysLeuVal CysVal ProTrp 60 

GACGCACGGCCGCCCCCCGCCGCCCCCTCCTTCCGCCAGGTG 
AspAlaArgProProProAlaAlaProSerPheArgGlnVal 

GGCCTCCCCGGGGTCGGCGTCCGGCTGGGGTTGAGGGCGGCCGGGGGGAACCAGCGACATGCGGAG 
GLPGVGVRLGLRAAGGNQRHAE 
ASPGSASGWG*GRPGGTSDMRR 
PPRGRRPAGVEGGRGEPATCGE 

AGCAGCGCAGGCGACTCAGGGCGCTTCCCCCGCAGGTG 
SSAGDSGRFPRR 
AAQATQGASPAG 
QRRRLRALPPQV 



TCCTGCCTGAAGGAGCTG 240 
SerCysLeuLysGluLeu 80 

GTGGCCCGAGTGCTGCAGAGGCTGTGCGAGCGCGGCGCGAAGAACGTGCTGGCCTTCGGC 300 

Val AlaArgVal LeuGl nArgLeuCysGl uArgGlyAl aLysAsnVal LeuAlaPheGly 100 

TTCGCGCTGCTGGACGGGGCCCGCGGGGGCCCCCCCGAGGCCTTCACCACCAGCGTGCGC 360 

PheAlaLeuLeuAspGlyA'laArgGlyGlyProProGluAlaPheThrThrSerValArg 120 

AGCTACCTGCCCAACACGGTGACCGACGCACTGCGGGGGAGCGGGGCGTGGGGGCTGCTG 420 

SerTyrLeuProAsnThrValThrAspAlaLeuArgGlySerGlyAlaTrpGlyLeuLeu 140 

TTGCGCCGCGTGGGCGACGACGTGCTGGTTCACCTGCTGGCACGCTGCGCGCTCTTTGTG 480 

LeuArgArgVal GlyAspAspVal LeuVal HisLeuLeuAl aArgCysAl aLeuPheVal 160 

CTGGTGGCTCCCAGCTGCGCCTACCAGGTGTGCGGGCCGCCGCTGTACCAGCTCGGCGCT 540 

LeuVal AlaProSerCysAlaTyrGlnValCysGlyProProLeuTyrGlnLeuGlyAl a 180 

GCCACTCAGGCCCGGCCCCCGCCACACGCTAGTGGACCCCGAAGGCGTCTGGGATGCGAA 600 

AlaThrGlnAlaArgProProProHisAlaSerGlyProArgArgArgLeuGlyCysGlu 200 



Fig. 11AB 



CGGGCCT66AACCATAGCGTCAG6GAGGCC6GGGTCCCCCTGGGCCTGCCAGCCCCGGGT 
ArgAl aTrpAsnHi sSerVal ArgGI uAl aGlyVal ProLeuGlyLeuProAl aProGly 

GCGAGGAGGCGCGGGGGCAGTGCCAGCCGAAGTCTGCCGTTGCCCAAGAGGCCCAGGCGT 
AlaArgArgArgGlyGlySerAlaSerArgSerLeuProLeuProLysArgProArgArg 




GGCGCTGCCCCTGAGCCGGAGCGGACGCCCGTTGGGCAGGGGTCCTGGGCCCACCCGGGC 780 

GlyAl aAl aProGl uProGl uArgThrProVal GlyGl nGlySerTrpAl aHi sProGly 260 

AGGACGCGTGGACCGAGTGACCGTGGTTTCTGTGTGGTGTCACCTGCCAGACCCGCCGAA 840 

ArgThrArgGlyProSerAspArgGlyPheCysVal Val SerProAl aArgProAl aGl u 280 

GAAGCCACCTCTTTGGAGGGTGCGCTCTCTGGCACGCGCCACTCCCACCCATCCGTGGGC 900 

Gl uAlaThrSerLeuGI uGl yAl aLeuSerGI yThrArgHi sSerHi sProSerVal Gl y 300 

CGCCAGCACCACGCGGGCCCCCCATCCACATCGCGGCCACCACGTCCCTGGGACACGCCT 960 

ArgGI nHisHisAl aGl yProProSerThrSerArgProProArgProTrpAspThrPro 320 

TGTCCCCCGGTGTACGCCGAGACCAAGCACTTCCTCTACTCCTCAGGCGACAAGGAGCAG 1020 

CysProProVal TyrAl aGl uThrLysHi sPheLeuTyrSerSerGlyAspLysGl uGl n 340 

CTGCGGCCCTCCTTCCTACTCAGCTCTCTGAGGCCCAGCCTGACTGGCGCTCGGAGGCTC 1080 

LeuArgProSerPheLeuLeuSerSerLeuArgProSerLeuThrGlyAl aArgArgLeu 360 

GTGGAGACCATCTTTCTGGGTTCCAGGCCCTGGATGCCAGGGACTCCCCGCAGGTTGCCC 1140 

ValGluThrllePheLeuGlySerArgProTrpMetProGlyThrProArgArgLeuPro 380 

CGCCTGCCCCAGCGCTACTGGCAAATGCGGCCCCTGTTTCTGGAGCTGCTTGGGAACCAC 1200 

ArgLeuProGlnArgTyrTrpGlnMetArgProLeuPheLeuGluLeuLeuGlyAsnHis 400 

GCGCAGTGCCCCTACGGGGTGCTCCTCAAGACGCACTGCCCGCTGCGAGCTGCGGTCACC 1260 

AlaGlnCysProTyrGlyVal LeuLeuLysThrHisCysProLeuArgAl aAlaValThr 420 

CCAGCAGCCGGTGTCTGTGCCCGGGAGAAGCCCCAGGGCTCTGTGGCGGCCCCCGAGGAG 1320 

ProAl aAl aGlyVal CysAl aArgGl uLysProGl nGlySerVal Al aAl aProGl uGl u 440 

GAGGACACAGACCCCCGTCGCCTGGTGCAGCTGCTCCGCCAGCACAGCAGCCCCTGGCAG 1380 

Gl uAspThrAspProArgArgLeuVal Gl nLeuLeuArgGl nHi sSerSerProTrpGl n 460 

GTGTACGGCTTCGTGCGGGCCTGCCTGCGCCGGCTGGTGCCCCCAGGCCTCTGGGGCTCC 1440 

Val TyrGlyPheVal ArgAl aCysLeuArgArgLeuVal ProProGlyLeuTrpGlySer 480 

AGGCACAACGAACGCCGCTTCCTCAGGAACACCAAGAAGTTCATCTCCCTGGGGAAGCAT 1500 

ArgHisAsnGluArgArgPheLeuArgAsnThrLysLysPhelleSerLeuGlyLysHis 500 

GCCAAGCTCTCGCTGCAGGAGCTGACGTGGAAGATGAGCGTGCGGGGCTGCGCTTGGCTG 1560 

AlaLysLeuSerLeuGlnGl uLeuThrTrpLysMetSerVal ArgAspCysAlaTrpLeu 520 



Fig. 11 AC 



CGCAGGAGCCCAGGGGTTGGCTGTGTTCCGGCCGCA6AGCACCGTCTGCGTGAGGAGATC 1620 

ArgArgSerProGlyVal GlyCysVal ProAl aAl aGl uHi sArgLeuArgGI uGl ul 1 e 540 

CTGGCCAAGTTCCTGCACTGGCTGATGAGTGTGTACGTCGTCGAGCTGCTCAGGTCTTTC 1680 

LeuAlaLysPheLeuHisTrpLeuMet.SerValTyrValValGluLeuLeuArgSerPhe 560 




TTTTATGTCACGGAGACCACGTTTCAAAAGAACAGGCTCTTTTTCTACCGGAAGAGTGTC 1740 

PheTyrValThrGluThrThrPheGlnLysAsnArgLeuPhePheTyrArgLysSerVal 580 

TGGAGCAAGTTGCAAAGCATTGGAATCAGACAGCACTTGAAGAGGGTGCAGCTGCGGGAG 1800 

TrpSerLysLeuGlnSerll eGlylleArgGlnHi sLeuLysArgValGl nLeuArgGl u 600 

CTGTCGGAAGCAGAGGTCAGGCAGCATCGGGAAGCCAGGCCCGCCCTGCTGACGTCCAGA 1860 

LeuSerGl uAl aGl uVal ArgGl nHi sArgGl uAl aArgProAl a LeuLeuThrSerArg 620 

CTCCGCTTCATCCCCAAGCCTGACGGGCTGCGGCCGATTGTGAACATGGACTACGTCGTG 1920 

LeuArgPhel 1 eProLysProAspGly LeuArgProIl eVal AsnHetAspTyrVal Val 640 

GGAGCCAGAACGTTCCGCAGAGAAAAGAGGGCCGAGCGTCTCACCTCGAGGGTGAAGGCA 1980 

GlyAl aArgThrPheArgArgGI uLysArgAl aGl uArgLeuThrSerArgVal LysAla 660 

CTGTTCAGCGTGCTCAACTACGAGCGGGCGCGGCGCCCCGGCCTCCTGGGCGCCTCTGTG 2040 

LeuPheSerVa 1 LeuAsnTyrGl uArgAl aArgArgProGlyLeuLeuGlyAlaSerVal 680 

CTGGGCCTGGACGATATCCACAGGGCCTGGCGCACCTTCGTGCTGCGTGTGCGGGCCCAG 2100 

LeuGlyLeuAspAspI 1 eHi sArgAl aTrpArgThrPheVal LeuArgVal ArgAl aGl n 700 

GACCCGCCGCCTGAGCTGTACTTTGTCAAGGTGGATGTGACGGGCGCGTACGACACCATC 2160 

AspProProProGluLeuTyrPheValLysValAspValThrGlyAlaTyrAspThrlle 720 

CCCCAGGACAGGCTCACGGAGGTCATCGCCAGCATCATCAAACCCCAGAACACGTACTGC 2220 

ProGl nAspArgLeuThrGl uVal II eAl aSer II ell eLysProGl nAsnThrTyrCys 740 

GTGCGTCGGTATGCCGTGGTCCAGAAGGCCGCCCATGGGCACGTCCGCAAGGCCTTCAAG 2280 

Val ArgArgTyrAlaVal Val Gl nLys Al aAl aHi sGlyHi sVal ArgLysAl aPheLys 760 

AGCCACGTCTCTACCTTGACAGACCTCCAGCCGTACATGCGACAGTTCGTGGCTCACCTG 2340 

SerHisValSerThrLeuThrAspLeuGlnProTyrMetArgGlnPheValAlaHisLeu 780 

CAGGAGACCAGCCCGCTGAGGGATGCCGTCGTCATCGAGCAGAGCTCCTCCCTGAATGAG 2400 

Gl nGl uThrSerProLeuArgAspAl aVal Val II eGl uGlnSerSerSerLeuAsnGl u 800 

GCCAGCAGTGGCCTCTTCGACGTCTTCCTACGCTTCATGTGCCACCACGCCGTGCGCATC 2460 

Al aSerSerGlyLeuPheAspVal PheLeuArgPheMetCysHi sHi sAl aVal Argil e 820 

AGGGGCAAGTCCTACGTCCAGTGCCAGGGGATCCCGCAGGGCTCCATCCTCTCCACGCTG 2520 

ArgGlyLysSerTyrVal GlnCysGl nGlyll eProGl nGlySerll eLeuSerThrLeu 840 



Fig. HAD 



CTCTGCAGCCTGTGCTACGGCGACATGGAGAACAAGCTGTTTGCGGGGATTCGGCGGGAC 2580 

LeuCysSerLeuCysTyrGlyAspMetGluAsnLysLeuPheAlaGlylleArgArgAsp 860 

GGGCTGCTCCTGCGTTTGGTGGATGATTTCTTGTTGGTGACACCTCACCTCACCCACGCG 2640 

GlyLeuLeuLeuArgLeuValAspAspPheLeuLeuValThrProHisLeuThrHisAla 880 

AAAACCTTCCTCAGGACCCTGGTCCGAGGTGTCCCTGAGTATGGCTGCGTGGTGAACTTG 2700 

LysThrPheLeuArgThrLeuValArgGlyValProGluTyrGlyCysValValAsnLeu 900 

CGGAAGACAGTGGTGAACTTCCCTGTAGAAGACGAGGCCCTGGGTGGCACGGCTTTTGTT 2760 

ArgLysThrVal ValAsnPheProVal Gl uAspGl uAl aLeuGlyGlyThrAl aPheVal 920 

CAGATGCCGGCCCACGGCCTATTCCCCTGGTGCGGCCTGCTGCTGGATACCCGGACCCTG 2820 

GlnMetProAlaHisGlyLeuPheProTrpCysGlyLeuLeuLeuAspThrArgThrLeu 940 

GAGGTGCAGAGCGACTACTCCAGCTATGCCCGGACCTCCATCAGAGCCAGTCTCACCTTC 2880 

GluValGlnSerAspTyrSerSerTyrAlaArgThrSerlleArgAlaSerLeuThrPhe 960 

AACCGCGGCTTCAAGGCTGGGAGGAACATGCGTCGCAAACTCTTTGGGGTCTTGCGGCTG 2940 

AsnArgGlyPheLysAl aGlyArgAsnMetArgArgLysLeuPheGlyVal LeuArgLeu 980 

AAGTGTCACAGCCTGTTTCTGGATTTGCAGGTGAACAGCCTCCAGACGGTGTGCACCAAC 3000 

LysCysHisSerLeuPheLeuAspLeuGlnValAsnSerLeuGlnThrValCysThrAsn 1000 

ATCTACAAGATCCTCCTGCTGCAGGCGTACAGGTTTCACGCATGTGTGCTGCAGCTCCCA 3060 

II eTyrLysIl eLeuLeuLeuGl nAl aTy rArgPheHi sAl aCysVal LeuGl nLeuPro 1020 

TTTCATCAGCAAGTTTGGAAGAACCCCACATTTTTCCTGCGCGTCATCTCTGACACGGCC 3120 

PheHisGlnGlnValTrpLysAsnProThrPhePheLeuArgVallleSerAspThrAls 1040 

TCCCTCTGCTACTCCATCCTGAAAGCCAAGAACGCAGGGATGTCGCTGGGGGCCAAGGGC 3180 

SerLeuCysTyrSerlleLeuLysAlaLysAsnAlaGlyMetSerLeuGlyAlaLysGly 1060 

GCCGCCGGCCCTCTGCCCTCCGAGGCCGTGCAGTGGCTGTGCCACCAAGCATTCCTGCTC 3240 

AlaAlaGlyProLeuProSerGluAlaValGlnTrpLeuCysHisGlnAlaPheLeuLeu 1080 

AAGCTGACTCGACACCGTGTCACCTACGTGCCACTCCTGGGGTCACTCAGGACAGCCCAG 3300 

LysLeuThrArgHisArgValThrTyrValProLeuLeuGlySerLeuArgThrAlaGln 1100 

ACGCAGCTGAGTCGGAAGCTCCCGGGGACGACGCTGACTGCCCTGGAGGCCGCAGCCAAC 3360 

ThrGlnLeuSerArgLysLeuProGlyThrThrLeuThrAlaLeuGluAlaAlaAlaAsn 1120 

CCGGCACTGCCCTCAGACTTCAAGACCATCCTGGAC 3420 

ProAlaLeuProSerAspPheLysThrlleLeuAsp 1132 
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Truncated protein 3 (ver. 2) 



ATGCCGCGCGCTCCCCGCTGCC6AGCCGTGC6CTCCCTGCTGC6CAGCCACTACCGCGAGGTGCTGCCGCTGGCCAC6TTCGTG 
HPRAPRCRAVRSLIRSHYREVLPIATFV 

CGGCGCCTGGGGCCCCAGGGCTGGCGGCTGGTGCAGCGCGGGGACCCGGCGGCTTTCCGCGCGCTGGTGGCCCAGTGCCTGGTGTGCGTGCCCTGGGACGCACGGCCGCCCCCCGCCGC 
R R L G P Q G H R L V Q R G 0 P A A F R A I V A Q C L V C V P H D A R P P P A A 



GGCCTCCCCGGGGTCGGCGTCC GGCT GGGGTTGAGGGCGGCCGGGGGGAAC CAGCGAC ATGCGGAG AGCAGCGC AGGCGACTC AGGGCGC TTCCCCC GCAGGTG 
G L P G V G V R L G t R A A G G N Q R H A E S S A G D S G R F P R R 
A S P G S A S G H G * G R P G G T S D H R R A A 0 A T Q G A S P A G 
PPRGRRPAGVEGGRGEPATCGEQRRRLRALPPQV 

CCCCTCCTTCCGCCAGGTGTCCTGCCTGAAGGAGCTGGTGGCCCGAGTGCTGCAGAGGCTGTGCGAGCGCGGCGCGAAGAACGTGCTGGCCTTCGGCTTCGCGCTGCTGGACGGGGCCCG 
PSFRQVSCIKELVARVIQRLCERGAKNVIAFGFAIIDGAR 

CGGGGGCCCCCCCGAGGCCTTCACCACCAGCGTGCGCAGCTACCTGCCCAACACGGTGACCGACGCACTGCGGGGGAGCGGGGCGTGGGGGCTGCTGCTGCGCCGCGTGGGCGACGACGT 
G G P P E A F T T S V R S Y L P N I V T D A L R G S G A W G L L L R R V G D D V 

GCTGGTTCACCTGCTGGCACGCTGCGCGCTCTTTGTGCTGGTGGCTCCCAGCTGCGCCTACCAGGTGTGCGGGCCGCCGCTGTACCAGCTCGGCGCTGCCACTCAGGCCCGGCCCCCGCC 
L V H L L A R C A L F V L V A P S C A Y Q V C G P P I Y Q L G A A I Q A R P P P 

ACACGCTAGTGGACCCCGAAGGCGTCTGGGAIGCGAACGGGCCTGGAACCATAGCGTCAGGGAGGCCGGGGTCCCCCTGGGCCTGCCAGCCCCGGGTGCGAGGAGGCGCGGGGGCAGTGC 
H A S G P R R R L G C E R A W N H S V R E A G V P L G L P A P G A R R R G G S A 

CAGCCGAAGTCTGCCGTTGCCCAAGAGGCCCAGGCGTGGCGCTGCCCCTGAGCCGGAGCGGACGCCCGTTGGGCAGGGGTCCTGGGCCCACCCGGGCAGGACGCGTGGACCGAGTGACCG 
SRSLPLPKRPRRGAAPEPERTPVGQGSWAHPGR1RGPSDR 

TGGTTTCTGTGTGGTGTCACCTGCCAGACCCGCCGAAGAAGCCACCTCTTTGGAGGGTGCGCTCTCTGGCACGCGCCACTCCCACCCATCCGTGGGCCGCCAGCACCACGCGGGCCCCCC 
G F C V V S P A R P A E E A T S L E G A L S G T R H S H P S V G R Q H H A G P P 

ATCCACATCGCGGCCACCACGTCCCTGGGACACGCCTTGTCCCCCGGTGTACGCCGAGACCAAGCACTTCCTCTACTCCTCAGGCGACAAGGAGCAGCTGCGGCCCTCCTTCCTACTCAG 
STSRPPRPWOTPCPPVYAETKHFLYSSGDKEQLRPSFLLS 

CTCTCTGAGGCCCAGCCTGACTGGCGCTCGGAGGCTCGTGGAGACCATCTTTCTGGGTTCCAGGCCCTGGATGCCAGGGACTCCCCGCAGGTTGCCCCGCCTGCCCCAGCGCTACTGGCA 
SLRPSLTGARRLVET1FLGSRPWHPGTPRRLPRLPQRYWQ 

AATGCGGCCCCTGTTTCTGGAGCTGCTTGGGAACCACGCGCAGTGCCCCTACGGGGTGCTCCTCAAGACGCACTGCCCGCTGCGAGCTGCGGTCACCCCAGCAGCCGGTGTCTGTGCCCG 
MRPLFLELLGNHAQCPYGVLLKTHCPLRAAVTPAAGVCAR 

GGAGAAGCCCCAGGGCTCTGTGGCGGCCCCCGAGGAGGAGGACACAGACCCCCGTCGCCTGGTGCAGCTGCTCCGCCAGCACAGCAGCCCCTGGCAGGTGTACGGCTTCGTGCGGGCCTG 
E K P Q G S V A A P E E E D T 0 P R R L V Q L L R 0 H S S P H Q V Y G F V R A C 

CCTGCGCCGGCTGGTGCCCCCAGGCCTCTGGGGCTCCAGGCACAACGAACGCCGCTTCCTCAGGAACACCAAGAAGTTCATCTCCCTGGGGAAGCATGCCAAGCTCTCGCTGCAGGAGCT 
L R R L V P P G L H G S R H N E R R F L R N I K K F 1 S L G K H A K L S L 0 £ L 



Fig. 11 AF 



GACGTGGAAGATGAGCGTGCGGGACTGCGCTTGGCTGCGCAGGAGCCCAG6GGTTGGCT6TGTTCCGGCCGCAGAGCACCGTCTGCGTGAGGAGATCCTGGCCAA6TTCCT6CACTG6CT 
THKMSVRDCAWLRRSPGVGCVPAAEHRLREE I LAKFLHHL 

GATGAGTGTGTACGTCGTCGAGCTGCTCAGGTCTTTCTTTTATGTCACGGAGACCACGTTTCAAAAGAACAGGCTCTTTTTCTACCGGAAGAGTGTCTGGAGCAAGTTGCAAAGCATTGG 
MSVYVVELLRSFFYVTETTFQKNRLFfYRKSVHSKLQSIG 

AATCAGACAGCACTTGAAGAGGGTGCAGCTGCGGGAGCTGTCGGAAGCAGAGGTCAGGCAGCATCGGGAAGCCAGGCCCGCCCTGCTGACGTCCAGACTCCGCTTCATCCCCAAGCCTGA 
IROHLKRVQLRELSEAEVRQHREARPALLTSRLRFIPKPD 

CGGGCTGCGGCCGATTGTGAACATGGACTACGTCGTGGGAGCCAGAACGTTCCGCAGAGAAAAGAGGGCCGAGCGTCTCACCTCGAGGGTGAAGGCACTGTTCAGCGTGCTCAACTACGA 
G L R P 1 V N H 0 y V V G A R T F R R E K R A E R I T S R V K A L F S V I N T E 

GCGGGCGCGGCGCCCCGGCCTCCTGGGCGCCTCTGTGCTGGGCCTGGACGATATCCACAGGGCCTGGCGCACCTTCGTGCTGCGTGTGCGGGCCCAGGACCCGCCGCCTGAGCTGTACTT 
R A R R P G I L G A S V I G L D D 1 H R A W R T f V L R V R A Q D P P P E L Y F 

TGTCAAGGTGGATGTGACGGGCGCGTACGACACCATCCCCCAGGACAGGCTCACGGAGGTCATCGCCAGCATCATCAAACCCCAGAACACGTACTGCGTGCGTCGGTATGCCGTGGTCCA 

V K V D V T G A Y 0 T I P Q D R L T E V 1 A S I I K P Q N T Y C V R R Y A V V Q 

GAAGGCCGCCCATGGGCACGTCCGCAAGGCCTTCAAGAGCCACGTCTCTACCTTGACAGACCTCCAGCCGTACATGCGACAGTTCGTGGCTCACCTGCAGGAGACCAGCCCGCTGAGGGA 
K A A H G H V R I A F K S H V S T L T D L Q P Y H R Q F ¥ A H L Q E T S P L R D 

TGCCGTCGTCATCGAGCAGAGCTCCTCCCTGAATGAGGCCAGCAGTGGCCTCTTCGACGTCTTCCTACGCTTCATGTGCCACCACGCCGTGCGCATCAGGGGCAAGTCCTACGTCCAGTG 
AVVIEQSSSLNEASSGLFDVFLRFHCHHAVRIRGKSYVQC 

CCAGGGGATCCCGCAGGGCTCCATCCTCTCCACGCTGCTCTGCAGCCTGTGCTACGGCGACATGGAGAACAAGCTGTTTGCGGGGATTCGGCGGGACGGGCTGCTCCTGCGTTTGGTGGA 
Q G I P Q G S 1 L S T L L C S I C Y G 0 H E N K L F A G I R R 0 G L L L R L V D 

TGATTTCTTGTTGGTGACACCTCACCTCACCCACGCGAAAACCTTCCTCAGGACCCTGGTCCGAGGTGTCCCTGAGTATGGCTGCGTGGTGAACTTGCGGAAGACAGTGGTGAACTTCCC 
0 F L L V T P H L T H A K T F L R T L V R G V P E Y G C V V N L R K T V V H F P 

TGTAGAAGACGAGGCCCTGGGTGGCACGGCTTTTGTTCAGATGCCGGCCCACGGCCTATTCCCCTGGTGCGGCCTGCTGCTGGATACCCGGACCCTGGAGGTGCAGAGCGACTACTCCAG 

V E D E A L G G T A F V Q H P A H G L F P H C G L L L D T R T L E V Q S 0 Y S R 



GTGAGCGCACCTGGCCGGAAGTGGAGCCTGTGCCCGGCTGGGGCAGGTGCTGCTGCAGGGCCGTTGCGTCCACCTCTGCTTCCGTGTGGGGCAGGCGACTGCCAATCCCAAAGGGTCAGA 



TGCCACAGGGTGCCCCTCGTCCCATCTGGGGCTGAGCACAAATGCATCTTTCTGTGGGAGTGAGGGTGCCTCACAACGGGAGCAGTTTTCTGTGCTATTTTGGTAA 



Fig. HAG 




APR 2 6 2DW w 

/ 



Altered C-terminus protein (ver. 2) 



ATGCCGCGCGCTCCCCGCTGCCGAGCCGTGCGCTCCCTGCTGCGCAGCCACTACCGCGAGGTGCTGCCGCTGGCCACGTTCGTG 
MPRAPRCRAVRSLLRSHYREVLPLATFV 



CGGCGCCTGGGGCCCCAGGGCTGGCGGCTGGTGCAGCGCGGGGACCCGGCGGCTTTCCGCGCGCTGGTGGCCCAGTGCCTGGTGTGCGTGCCCTGGGACGCACGGCCGCCCCCCGCCGC 
R R L G P Q G H R L V Q R G D P A A f R A L V A Q C L V C V P H 0 A R P P P A A 



6GCCTCCCCGGGGTCGGCGTCCGGCTGGGGTTGA66GCGGCC6G66GGAACCAGCGACATGCGGAGAGCAGCGCAGGCGACTCAGGGCGCTTCCCCCGCAGGT6 
GLPGVGVRLGLRAAGGNQRHAESSAGDSGRFPRR 
A S P G S A S G H G * G R P G G T S 0 H R R A A 0 A T Q G A S P A G 
PPRGRRPAGVEGGRGEPATCGEQRRRLRALPPQV 

CCCCTCCTTCCGCCAGGTGTCCTGCCTGAAGGAGCTGGTGGCCCGAGTGCTGCAGAGGCTGTGCGAGCGCGGCGCGAAGAACGTGCTGGCCTTCGGCTTCGCGCTGCTGGACGGGGCCCG 
P S F R Q V S C L K E I V A R V l Q R L C E R G A K N V L A F G F A I L 0 G A R 

CGGGGGCCCCCCCGAGGCCTTCACCACCAGCGTGCGCAGCTACCTGCCCAACACGGTGACCGACGCACTGCGGGGGAGCGGGGCGTGGGGGCTGCTGCTGCGCCGCGTGGGCGACGACGT 
G G P P E A F T T S V R S Y L P N T V I D A L R G S G A W G L L L R R V G 0 D V 

GCTGGTTCACCTGCTGGCACGCTGCGCGCTCTTTGTGCTGGTGGCTCCCAGCTGCGCCTACCAGGTGTGCGGGCCGCCGCTGTACCAGCTCGGCGCTGCCACTCAGGCCCGGCCCCCGCC 
LVHLLARCALFVLVAPSCAYQVCGPPLYQLGAATQARPPP 

ACACGCTAGTGGACCCCGAAGGCGTCTGGGATGCGAACGGGCCTGGAACCATAGCGTCAGGGAGGCCGGGGTCCCCCTGGGCCTGCCAGCCCCGGGTGCGAGGAGGCGCGGGGGCAGTGC 
H A S G P R R R L G C E R A W N H S V R E A G V P L G L P A P G A R R R G G S A 

CAGCCGAAGTCTGCCGTTGCCCAAGAGGCCCAGGCGTGGCGCTGCCCCTGAGCCGGAGCGGACGCCCGTTGGGCAGGGGTCCTGGGCCCACCCGGGCAGGACGCGTGGACCGAGTGACCG 
S R S I P L P K R P R R G A A P E P E R T P V G Q G S H A H P G R I R G P S D R 

TGGTTTCTGTGTGGTGTCACCTGCCAGACCCGCCGAAGAAGCCACCTCTTTGGAGGGTGCGCTCTCTGGCACGCGCCACTCCCACCCATCCGTGGGCCGCCAGCACCACGCGGGCCCCCC 
G F C V V S P A R P A E E A T S L E G A L S G T R H S H P S V G R Q H H A G P P 

ATCCACATCGCGGCCACCACGTCCCTGGGACACGCCTTGTCCCCCGGTGTACGCCGAGACCAAGCACTTCCTCTACTCCTCAGGCGACAAGGAGCAGCTGCGGCCCTCCTTCCTACTCAG 
S T S R P P R P W 0 T P C P P V Y A E I K H F I Y S S G D K E 0 L R P S F L L S 

CTCTCTGAGGCCCAGCCTGACTGGCGCTCGGAGGCTCGTGGAGACCATCTTTCTGGGTTCCAGGCCCTGGATGCCAGGGACTCCCCGCAGGTTGCCCCGCCTGCCCCAGCGCTACTGGCA 
SLRPSLTGARRLVETiFLGSRPWHPGTPRRLPRLPQRYHQ 

AATGCGGCCCCTGTTTCTGGAGCTGCTTGGGAACCACGCGCAGTGCCCCTACGGGGTGCTCCTCAAGACGCACTGCCCGCTGCGAGCTGCGGTCACCCCAGCAGCCGGTGTCTGTGCCCG 
MRPLFLELLGNHAQCPYGVLLKTHCPLRAAVTPAAGVCAR 

GGAGAAGCCCCAGGGCTCTGTGGCGGCCCCCGAGGAGGAGGACACAGACCCCCGTCGCCTGGTGCAGCTGCTCCGCCAGCACAGCAGCCCCTGGCAGGTGTACGGCTTCGTGCGGGCCTG 
EKPQGSVAAPEEEDTDPRRLVQLLRQHSSPWQVYGFVRAC 

CCTGCGCCGGCTGGTGCCCCCAGGCCTCTGGGGCTCCAGGCACAACGAACGCCGCTTCCTCAGGAACACCAAGAAGTTCATCTCCCTGGGGAAGCATGCCAAGCTCTCGCTGCAGGAGCT 
LRRIVPPGLWGSRHNERRFLRNTKKFISLGKHAKLSLQEL 



Fig. 11 AH 



GAC6TGGAAGATGAGCGT6CG6GACTGC6CTTGGCTGCGCAGGAGCCCAGGGGTTGGCTGTGTTCCGGCCGCAGAGCACCGTCT6CGTGAGGAGATCCTGGCCAA6TTCCTGCACTGGCT 
TWKMSVROCAHLRRSPGVGCVPAAEHRLRE E I LAKFLHHL 

GATGA6TGTGTACGTCGTCGAGCTGCTCAGGTCTTTCTTTTATGTCACGGAGACCACGTTTCAAAAGAACAGGCTCTTTTTCTACC6GAAGAGTGTCTGGAGCAAGTTGCAAAGCATTGG 
M 5 V Y V V E I L R $ F F Y V T E T T F Q K N R L F f Y R K S V W S K I Q $ i G 

AATCAGACAGCACTTGAAGAGGGTGCAGCTGCGGGAGCTGTCGGAAGCAGAGGTCAGGCAGCATCGGGAAGCCAGGCCCGCCCTGCTGACGTCCAGACTCCGCTTCATCCCCAAGCCTGA 
IRQHLKRYQIREISEAEVRQHREARPAIITSRLRFIPKPD 

CGGGCTGCGGCCGATTGTGAACATGGACTACGTCGTGGGAGCCAGAACGTTCCGCAGAGAAAAGAGGGCCGAGCGTCTCACCTCGAGGGTGAAGGCACTGTTCAGCGTGCTCAACTACGA 
G L R P 1 V N M 0 y V V G A R T F R R E K R A E R L T S R V K A L F S V L N Y E 

GCGGGCGCGGCGCCCCGGCCTCCTGGGCGCCTCTGTGCTGGGCCTGGACGATATCCACAGGGCCTGGCGCACCTTCGTGCTGCGTGTGCGGGCCCAGGACCCGCCGCCTGAGCTGTACTT 
R A R R P G I L G A S V I G L D 0 I H R A H R T F V L R V R A Q 0 P P P E L Y F 

TGTCAAGGTGGATGTGACGGGCGCGTACGACACCATCCCCCAGGACAGGCTCACGGAGGTCATCGCCAGCATCATCAAACCCCAGAACACGTACTGCGTGCGTCGGTATGCCGTGGTCCA 
VKVDVTGAYOTiPQDRlTEVIASIIKPQNTYCVRRYAVVQ 

GAAGGCCGCCCATGGGCACGTCCGCAAGGCCTTCAAGAGCCACGTCTCTACCTTGACAGACCTCCAGCCGTACATGCGACAGTTCGTGGCTCACCTGCAGGAGACCAGCCCGCTGAGGGA 
KAAHGHVRKAFKSHVSTIIDLQPYMRQFVAHLOETSPLRO 

TGCCGTCGTCATCGAGCAGAGCTCCTCCCTGAATGAGGCCAGCAGTGGCCTCTTCGACGTCTTCCTACGCTTCATGTGCCACCACGCCGTGCGCATCAGGGGCAAGTCCTACGTCCAGTG 
A V M E Q S S S L N E A S S G L F D V F L R F H C H H A V R I R G K S Y V Q C 

CCAGGGGATCCCGCAGGGCTCCATCCTCTCCACGCTGCTCTGCAGCCTGTGCTACGGCGACATGGAGAACAAGCTGTTTGCGGGGATTCGGCGGGACGGGCTGCTCCTGCGTTTGGTGGA 
0 G I P Q G S I L S T L L C S I C Y G D H E N K L F A G I R R 0 G L L I R L V D 

TGATTTCTTGTTGGTGACACCTCACCTCACCCACGCGAAAACCTTCCTCAGGACCCTGGTCCGAGGTGTCCCTGAGTATGGCTGCGTGGTGAACTTGCGGAAGACAGTGGTGAACTTCCC 
0 F I I V T P H L T H A K T F L R T L V R G V P E Y G C V V N L R K T V V N F P 

TGTAGAAGACGAGGCCCTGGGTGGCACGGCTTTTGTTCAGATGCCGGCCCACGGCCTATTCCCCTGGTGCGGCCTGCTGCTGGATACCCGGACCCT6GAGGTGCAGAGCGACTACTCCAG 
V E 0 E A L G G T A F V Q H P A H G L f P W C G L L L D T R T L E V Q S 0 Y S S 

CTATGCCCGGACCTCCATCAGAGCCAGTCTCACCTTCAACCGCGGCTTCAAGGCTGGGAGGAACATGCGTCGCAAACTCTTTGGGGTCTTGCGGCTGAAGTGTCACAGCCTGTTTCTGGA 
YARTSIRASLTFNRGFKAGRNMRRKLFGVLRLKCHSLFLD 

TTTGCAGGTGAACAGCCTCCAGACGGTGTGCACCAACATCTACAAGATCCTCCTGCTGCAGGCGTACAGGTTTCACGCATGTGTGCTGCAGCTCCCATTTCATCAGCAAGTTTGGAAGAA 
LQVNSLQTVCTNIYKILLLQAYRFHACVL-QLPFHQQVWKN 

CCCCACATTTTTCCTGCGCGTCATCTCTGACACGGCCTCCCTCTGCTACTCCATCCTGAAAGCCAAGAACGCAGGGATGTCGCTGGGGGCCAAGGGCGCCGCCGGCCCTCTGCCCTCCGA 
P T F F L R V I S 0 T A S L C Y S I L K A K N A E 



E E N I I V V T P A V l G S 

GGGACAGCCAGAGATGGAGCCACCCCGCAGACCGTCGGGTGTGGGCAGCTTTCCGGTGTCTCCTGGGAGGGGAGTTGGGCTGGGCCTGTGACTCCTCAGCCTCTGTTTTCCCCCAG 
G Q P E H E P P R R P S G V G S F P Y S P G R G V G L G L * 



Fig. 11AI 




Protein that lacks motif A (ver. 2) 



ATGCC6C6CGCTCCCC6CTGCCGA6CCGTGCGCTCCCTGCTGCGCAGCCACTACC6CGAGGT6CTGCCGCTGGCCAC6TTCGTG 
MPRAPRCRAVRSLLRSHYREVLPIATFV 

CGGCGCCTGGGGCCCCAGGGCTGGCGGCTGGTGCAGCGCGGGGACCCGGCGGCTTTCCGCGCGCTGGTGGCCCAGTGCCTGGTGTGCGTGCCCTGGGACGCACGGCCGCCCCCCGCCGC 
R R L G P Q G H R L V Q R G 0 P A A F R A L V A Q C L V C V P W 0 A R P P P A A 



GGCCTCCCCGGGGTCGGCGTCCGGCTGGGGTTGAGGGCGGCCGGGGGGAACCAGCGACATGCGGAGAGCAGCGCAGGCGACTCAGGGCGCTTCCCCCGCAGGTG 
GIPGVGVRLGIRAAGGNQRHAESSAGDSGRFPRR 
A S P G S A S G W G * G R P G G T S 0 H R R A A Q A T Q G A S P A G 
P P R G R R P A G V E G G R G E P A T C G E Q R R R L R A L P P Q V 

CCCCTCCTTCCGCCAGGTGTCCTGCCTGAAGGAGCTGGTGGCCCGAGTGCTGCAGAGGCTGTGCGAGCGCGGCGCGAAGAACGTGCTGGCCTTCGGCTTCGCGCTGCTGGACGGGGCCCG 
PSFRQVSCLKEIVARVLQRICERGAKNVLAFGFALLOGAR 

CG6GGGCCCCCCCGAGGCCTTCACCACCAGCGTGCGCAGCTACCTGCCCAACACGGTGACCGACGCACTGCGGGGGAGCGGGGCGTGGGGGCTGCTGCTGCGCCGCGTGGGCGACGACGT 
G G P P E A F T T S V R S Y L P N T V T 0 A L R G S G A H G L L L R R V G D 0 V 

GCTGGTTCACCTGCTGGCACGCTGCGCGCTCTTTGTGCTGGTGGCTCCCAGCTGCGCCTACCAGGTGTGCGGGCCGCCGCTGTACCAGCTCGGCGCTGCCACTCAGGCCCGGCCCCCGCC 
L V H L L A R C A L F V L V A P S C A Y Q V C G P P L Y Q I G A A T Q A R P P P 

ACACGCTAGTGGACCCCGAAGGCGTCTGGGATGCGAACGGGCCTGGAACCATAGCGTCAGGGAGGCCGGGGTCCCCCTGGGCCTGCCAGCCCCGGGTGCGAGGAGGCGCGGGGGCAGTGC 
H A S G P R R R L G C E R A W N H S V R E A G V P L G L P A P G A R R R G G S A 

CAGCCGAAGTCTGCCGTTGCCCAAGAGGCCCAGGCGTGGCGCTGCCCCTGAGCCGGAGCGGACGCCCGTTGGGCAGGGGTCCTGGGCCCACCCGGGCAGGACGCGTGGACCGAGTGACCG 
S R S L P L P K R P R R G A A P E P E R T P V G Q G S W A H P G R T R G P S D R 

TGGTTTCTGTGTGGTGTCACCTGCCAGACCCGCCGAAGAAGCCACCTCTTTGGAGGGTGCGCTCTCTGGCACGCGCCACTCCCACCCATCCGTGGGCCGCCAGCACCACGCGGGCCCCCC 
G f C V V S P A R P A E E A T S L E G A L S G T R H S H P S V G R Q H H A G P P 

ATCCACATCGCGGCCACCACGTCCCTGGGACACGCCTTGTCCCCCGGTGTACGCCGAGACCAAGCACTTCCTCTACTCCTCAGGCGACAAGGAGCAGCTGCGGCCCTCCTTCCTACTCAG 
S I S R P P R P W 0 T P C P P V Y A E T K H F L Y 5 S G 0 K E Q L R P S F L L S 

CTCTCTGAGGCCCAGCCTGACTGGCGC TCGGAGG CTC GTGGAGACC ATCTTTC TGGGTTCC AGGCCCTGGATGCCAGGGAC T CCCCGC AGGTTGC CCCGCCTGC C C C AGCGCT ACTGGCA 
S L R P S L T G A R R L V E I I F L G S R P W H P G T P R R L P R L P Q R Y W Q 

AATGCGGCCCCTGTTTCTGGAGCTGCTTGGGAACCACGCGCAGTGCCCCTACGGGGTGCTCCTCAAGACGCACTGCCCGCTGCGAGCTGCGGTCACCCCAGCAGCCGGTGTCTGTGCCCG 
M R P L F L E L L G N H A Q C P Y G V L L K T H C P L R A A V T P A A G V C A R 

GGAGAAGCCCCAGGGCTCTGTGGCGGCCCCCGAGGAGGAGGACACAGACCCCCGTCGCCTGGTGCAGCTGCTCCGCCAGCACAGCAGCCCCTGGCAGGTGTACGGCTTCGTGCGGGCCTG 
E K P Q 6 S V A A P E E E 0 T D P R R I V Q L L R Q H $ S P « Q V Y G F V R A C 

CCTGCGCCGGCTGGTGCCCCCAGGCCTCTGGGGCTCCAGGCACAACGAACGCCGCTTCCTCAGGAACACCAAGMGTTCATCTCCCTGGGGAAGCATGCCAAGCTCTCGCTGCAGGAGCT 
L R « L V P P G I H G $ R H N E R R F L R N T K K F I S L G K H A K L S L Q E L 



Fig. 11 A J 



GACGTGGAAGATGAGCGTGCGGGACTGCGCTTGGCTGCGCAGGAGCCCAGGG6TTGGCTGTGTTCCGGCCGCAGAGCACCGTCTGCGT6AGGAGATCCT66CCAAGTTCCTGCACTGGCT 
TWKMSVROCAWLRRSPGVGCVPAAEHRLREE I LAKFLHWL 

GATGAGTGTGTACGTCGTCGAGCTGCTCAGGTCTTTCTTTTATGTCACGGAGACCACGTTTCAAAAGAACAGGCTCTTTTTCTACCGGAAGAGTGTCTGGAGCAAGTTGCAAAGCATTGG 
M S V Y V V f L L R 5 F F Y V T E T T F Q K N R I f F Y R K S V H S K L Q S I G 

AATCAGACAGCACTTGAAGAGGGTGCAGCTGCGGGAGCTGTCGGAAGCAGAGGTCAGGCAGCATCGGGAAGCCAGGCCCGCCCTGCTGACGTCCAGACTCCGCTTCATCCCCAAGCCTGA 
i R Q H L K R V Q L R E L S E A E V R Q H R E A R P A L L T S R L R F I P K P 0 

CGGGCTGCGGCCGATTGTGAACATGGACTACGTCGTGGGAGCCAGAACGTTCCGCAGAGAAAAGAGGGCCGAGCGTCTCACCTCGAGGGTGAAGGCACTGTTCAGCGTGCTCAACTACGA 
G L R P I V N H 0 Y V V G A R T F R R E K R A E R L I S R V K A L F S V L N Y E 

GCGGGCGCGGCGCCCCGGCCTCCTGGGCGCCTCTGTGCTGGGCCTGGACGATATCCACAGGGCCTGGCGCACCTTCGTGCTGCGTGTGCGGGCCCAGGACCCGCCGCCTGAGCTGTACTT 
R A R R P G I L G A S V L G L 0 D I H R A H R T F V L R V R A Q D P P P E L Y F 

TGTCAAG GACAGGCTCACGGAGGTCATCGCCAGCATCATCAAACCCCAGAACACGTACTGCGTGCGTCGGTATGCCGTGGTCCA 

V K 0 R L T E V I A S 1 I K P 0 N T Y C V R R Y A V V Q 

GAAGGCCGCCCATGGGCACGTCCGCAAGGCCTTCAAGAGCCACGTCTCTACCTTGACAGACCTCCAGCCGTACATGCGACAGTTCGTGGCTCACCTGCAGGAGACCAGCCCGCTGAGGGA 
K A A H G H ¥ R K A F I S H V S T L T 0 L Q P Y'M R Q F V A H L Q E T S P L R 0 

TGCCGTCGTCATCGAGCAGAGCTCCTCCCTGAATGAGGCCAGCAGTGGCCTCTTCGACGTCTTCCTACGCTTCATGTGCCACCACGCCGTGCGCATCAGGGGCAAGTCCTACGTCCAGTG 
A V V I E Q S S S L N E A S S G L F 0 V F L R F N C H H A V R I R G K S Y V Q C 

CCAGGGGATCCCGCAGGGCTCCATCCTCTCCACGCTGCTCTGCAGCCTGTGCTACGGCGACATGGAGAACAAGCTGTTTGCGGGGATTCGGCGGGACGGGCTGCTCCTGCGTTTGGTGGA 
QGIPQGSILSIILCSLCYGDHENKLFAGIRRDGLLLRLVO 

TGATTTCTTGTTGGTGACACCTCACCTCACCCACGCGAAAACCTTCCTCAGGACCCTGGTCCGAGGTGTCCCTGAGTATGGCTGCGTGGTGAACTTGCGGAAGACAGTGGTGAACTTCCC 
0 F L I V T P H L T H A K T F I R T L V R G V P E Y G C V V N L R K I V V N F P 

TGTAGAAGACGAGGCCCTGGGTGGCACGGCTTTTGTTCAGATGCCGGCCCACGGCCTATTCCCCTGGTGCGGCCTGCTGCTGGATACCCGGACCCTGGAGGTGCAGAGCGACTACTCCAG 

V E D E A L G G T A F V Q H P A H G L F P H C G I L I D T R T L E V Q S 0 Y S S 

CTATGCCCGGACCTCCATCAGAGCCAGTCTCACCTTCAACCGCGGCTTCAAGGCTGGGAGGAACATGCGTCGCAAACTCTTTGGGGTCTTGCGGCTGAAGTGTCACAGCCTGTTTCTGGA 
YARTSIRASlTFNRGFKAGRNMRRKlfGVLRLKCHSlFLD 

TTTGCAGGTGAACAGCCTCCAGACGGTGTGCACCAACATCTACAAGATCCTCCTGCTGCAGGCGTACAGGTTTCACGCATGTGTGCTGCAGCTCCCATTTCATCAGCAAGTTTGGAAGAA 
L Q V N S L Q T V C T N I Y K I I L L Q A Y R F H A C V L 0 L P F H 0 0 V W K N 

CCCCACATTTTTCCTGCGCGTCATCTCTGACACGGCCTCCCTCTGCTACTCCATCCTGAAAGCCAAGAACGCAGGGATGTCGCTGGGGGCCAAGGGCGCCGCCGGCCCTCTGCCCTCCGA 
P T F F L R V I S 0 T A S L C Y S I L K A K N A G H S L G A K G A A G P L P S E 

GGCCGTGCAGTGGCTGTGCCACCAAGCATTCCTGCTCAAGCTGACTCGACACCGTGTCACCTACGTGCCACTCCTGGGGTCACTCAGGACAGCCCAGACGCAGCTGAGTCGGAAGCTCCC 
AVQHLCHQAFLLKLTRHRVTYVPLLGSLRTAQTQLSRKIP 

GGGGACGACGCTGACTGCCCTGGAGGCCGCAGCCAACCCGGCACTGCCCTCAGACTTCAAGACCATCCTGGACTGATGGCCACCCGCCCACAGCCAGGCCGAGAGCAGACACCAGCAGCC 
G T T L T A I E A A A N P A L P S D F K T I L D 



Fig. 11 AK 



CTGTCACGCCG6GCTCTACGTCCCAGGGAGGGAGGGGCGGCCCACACCCAGGCCCGCACCGCTGGGAGTCTGAGGCCTGAGTGA6TGTTTGGCCGAGGCCTGCATGTCCGGCTGAAGGCT 
GAGTGTCCGGCTGAGGCCTGAGCGAGTGTCCAGCCAAGGGCTGAGTGTCCAGCACACCTGCCGTCTTCACTTCCCCACAGGCTGGCGCTCGGCTCCACCCCAGGGCCAGCTTTTCCTCAC 
CAGGAGCCCGGCTTCCACTCCCCACATAGGAATAGTCCATCCCCAGATTCGCCATTGTTCACCCCTCGCCCTGCCCTCCTTTGCCTTCCACCCCCACCATCCAGGTGGAGACCCTGAGAA 
GGACCCTGGGAGCTCTGGGAATTTGGAGTGACCAAAGGTGTGCCCTGTACACAGGCGAGGACCCTGCACCTGGATGGGGGTCCCTGTGGGTCAAATTGGGGGGAGGTGCTGTGGGAGTAA 
AATACTGAATATATGAGTTTTTCAGTTTTGA 



Fig. HAL 



Truncated protein that lacks motif A (ver. 2) 



ATGCC6CGCGCTCCCCGCTGCCGAGCCGTGCGCTCCCTGCTGCGCAGCCACTACCGCGAGGTGCTGCCGCTGGCCACGTTCGT6 
N P R A P R C R A ¥ R S L L R S H Y R E ¥ L P L A T F ¥ 

CGGCGCCTGGGGCCCCAGGGCTGGCGGCTGGTGCAGCGCGGGGACCCGGCGGCTTTCCGCGCGCTGGTGGCCCAGTGCCTGGTGTGCGTGCCCTGGGACGCACGGCCGCCCCCCGCCGC 
R R I G P Q G H R L V Q R G D P A A F R A L ¥ A Q C L V C V P W D A R P P P A A 



GGCCTCCCCGGGGTCGGCGTCCGGCT6GGGTTGAGGGCGGCCGGGGGGAACCAGCGACATGCGGAGAGCAGCGCAGGCGACTCAGGGCGCTTCCCCCGCAGGTG 
G L P G ¥ G ¥ R L G L R A A G G N Q<R H A E S S A G D S G R F P R R 
A S P G S A S G W G * G R P G G I S 0 H R R A A Q A T Q G A S P A G 

P P R G R R P A G ¥ E G G R G E P A T C G E Q R R R 'l R A L P P 0 ¥ 

CCCCTCCTTCCGC C AGGTGTCCTGCC TGAAGGAGC TGGTGGC CCGAGTGC TGCAGAGGC TGTGCG AGCGCGGCGCGAAGAACGTGCTGGC CTTCGGCT TCGCGC TGCTGGACGGGGCCCG 
P 5 F R Q ¥ S C L K E L ¥ A R ¥ L Q R L C E R G A K N ¥ L A F G F A L L 0 G A R 

CGGGGGCCCCCCCGAGGCCTTCACCACCAGCGTGCGCAGCTACCTGCCCAACACGGTGACCGACGCACTGCGGGGGAGCGGGGCGTGGGGGCTGCTGCTGCGCCGCGTGGGCGACGACGT 
G G P P E A F T T S ¥ R S ¥ L P N T ¥ T 0 A L R G S G A H G L L L R R V G 0 D V 

GCTGGTTCACCTGCTGGCACGCTGCGCGCTCTTTGTGCTGGTGGCTCCCAGCTGCGCCTACCAGGTGTGCGGGCCGCCGCTGTACCAGCTCGGCGCTGCCACTCAGGCCCGGCCCCCGCC 
L » H L L A R C A L F ¥ L ¥ A P S C A ¥ Q ¥■ C G P P L ¥ Q L G A A T Q A R P P P 

ACACGCTAGTGGACCCCGAAGGCGTCTGGGATGCGAACGGGCCTGGAACCATAGCGTCAGGGAGGCCGGGGTCCCCCTGGGCCTGCCAGCCCCGGGTGCGAGGAGGCGCGGGGGCAGTGC 
" A S G P R R R L G C E R A W N H S ¥ R E A G ¥ P L G L P A P G A R R R G G S A 

CAGCCGAAGTCTGCCGTTGCCCAAGAGGCCCA6GCGTGGCGCTGCCCCTGAGCCGGAGCGGACGCCCGTTGGGCAGGGGTCCTGGGCCCACCCGGGCAGGACGCGTGGACCGAGTGACCG 
S R S L P I P K R P R R G A A P E P E R T P ¥ G Q G S y A H P G R T R G P S D R 

TGGTTTCTGTGTGGTGTCACCTGCCAGACCCGCCGAAGAAGCCACCTCTTTGGAGGGTGCGCTCTCTGGCACGCGCCACTCCCACCCATCCGTGGGCCGCCAGCACCACGCGGGCCCCCC 
G F C ¥ V S P A R P A E E A T S L E G A I S G T R H S H P S ¥ G R Q H H A G P P 

ATCCACATCGCGGCCACCACGTCCCTGGGACACGCCTTGTCCCCCGGTGTACGCCGAGACCAAGCACTTCCTCTACTCCTCAGGCGACAAGGAGCAGCTGCGGCCCTCCTTCCTACTCAG 
$TSRPPRPHDTPCPP¥YAETKHFLYSSGDKEOLRPSFLl$ 

CTCTCTGAGGCCCAGCCTGACTGGCGCTCGGAGGCTCGTGGAGACCATCTTTCTGGGTTCCAGGCCCTGGATGCCAGGGACTCCCCGCAGGTTGCCCCGCCTGCCCCAGCGCTACTGGCA 
s I R P $ L T G A R R L ¥ E T i F L G $ R P W H P G T P R R L P R L P Q R Y H Q 

AATGCGGCCCCTGTTTCTGGAGCTGCTTGGGAACCACGCGCAGTGCCCCTACGGGGTGCTCCTCAAGACGCACTGCCCGCTGCGAGCTGCGGTCACCCCAGCAGCCGGTGTCTGTGCCCG 
H R P L F L E L L G N H A Q C P Y G V L L K T H C P L R A A ¥ T P A A G ¥ C A R 

GGAGAAGCCCCAGGGCTCTGTGGCGGCCCCCGAGGAGGAGGACACAGACCCCCGTCGCCTGGTGCAGCTGCTCCGCCAGCACAGCAGCCCCTGGCAGGTGTACGGCTTCGTGCGGGCCTG 
E 1 p Q G S ¥ A A P E E E 0 T 0 P R R L V Q L L R Q H $ $ P H Q ¥ Y G F ¥ R A C 

CCTGCGCCGGCTGGTGCCCCCAGGCCTCTGGGGCTCCAGGCACAACGAACGCCGCTTCCTCAGGAACACCAAGAAGTTCATCTCCCTGGGGAAGCATGCCAAGCTCTCGCTGCAGGAGCT 
" " L V P P G I W G $ R H N E R R F L R N T K K F I S L G K H A K L $ L Q E I 



Fig. 11AM 



GACGTGGAAGATGAGCGTGCG6GACTGCGCTTGGCTGCGCAGGAGCCCAGGG6TTGGCTGTGTTCCGGCCGCAGAGCACCGTCT6CGT6AGGAGATCCTGGCCAAGTTCCT6CACTGGCT 
THKMSVROCAHLRRSPGVGCVPAAEHRLREE I LAKFLHHL 

GATGAGTGTGTACGTCGTCGAGCTGCTCAGGTCTTTCTTTTATGTCACGGAGACCACGTTTCAAAAGAACAGGCTCTTTTTCTACCGGAAGAGTGTCTGGAGCAAGTTGCAAAGCATTGG 
M S V Y V V E I l R $ f F Y V T E T T F Q K N R L F F Y R K S V W $ K I Q S 1 G 

AATCAGACAGCACTTGAAGAGGGTGCAGCTGCGGGAGCTGTCGGAAGCAGAGGTCAGGCAGCATCGGGAAGCCAGGCCCGCCCTGCTGACGTCCAGACTCCGCTTCATCCCCAAGCCTGA 
i R Q H I K R V 0 L R £ L S E A E V R Q H R E A R P A L L T S R L R F I P K P 0 

CGGGCTGCGGCCGATTGTGAACATGGACTACGTCGTGGGAGCCAGAACGTTCCGCAGAGAAAAGAGGGCCGAGCGTCTCACCTCGAGGGTGAAGGCACTGTTCAGCGTGCTCAACTACGA 
G L R P I V N H 0 y V V G A R T F R R E K R A E R L T $ R V K A L F 5 V L N Y E 

GCGGGCGCGGCGCCCCGGCCTCCTGGGCGCCTCTGTGCTGGGCCTGGACGA7ATCCACAGGGCCTGGCGCACCTTCGTGCTGCGTGTGCGGGCCCAGGACCCGCCGCCTGAGCTGTACTT 
R A R R P G L L G A S V I G I D D I H R A H R T f V L R V R A Q 0 P P P E L Y F 

TGTCAAG GACAGGCTCACGGAGGTCATCGCCAGCATCATCAAACCCCAGAACACGTACTGCGTGCGTCGGTATGCCGTGGTCCA 

V K 0 R L T E V 1 A $ I i K P Q N T Y C V R R Y A V V Q 

GAAGGCCGCCCATGGGCACGTCCGCAAGGCCTTCAAGAGCCACGTCTCTACCTTGACAGACCTCCAGCCGTACATGCGACAGTTCGTGGCTCACCTGCAGGAGACCAGCCCGCTGAGGGA 
KAAHGHVRKAFKSHVSTlTDLQPYMRQfVAHLQETSPLRD 

TGCCGTCGTCATCGAGCAGAGCTCCTCCCTGAATGAGGCCAGCAGTGGCCTCTTCGACGTCTTCCTACGCTTCATGTGCCACCACGCCGTGCGCATCAGGGGCAAGTCCTACGTCCAGTG 
A V V I E Q S S $ L H E A S S G L F 0 V F L R F H C H H A V R 1. R G K S Y V 0 C 

CCAGGGGATCCCGCAGGGCTCCATCCTCTCCACGCTGCTCTGCAGCCTGTGCTACGGCGACATGGAGAACAAGCTGTTTGCGGGGATTCGGCGGGACGGGCTGCTCCTGCGTTTGGTGGA 
0 G I P Q G S I L S T L L C S I C Y G D H E N K I F A G I R R 0 G L I L R L V 0 

TGATTTCTTGTTGGTGACACCTCACCTCACCCACGCGAAAACCTTCCTCAGGACCCTGGTCCGAGGTGTCCCTGAGTATGGCTGCGTGGTGAACTTGCGGAAGACAGTGGTGAACTTCCC 
0 F L L V T P H L T H A K T F L R T L V R G V P E Y G C V V N L R K I V V N F P 

TGTAGAAGACGAGGCCCTGGGTGGCACGGCTTTTGTTCAGATGCCGGCCCACGGCCTATTCCCCTGGTGCGGCCTGCTGCTGGATACCCGGACCCTGGAGGTGCAGAGCGACTACTCCAG 

V E D E A L G G T A F V Q H P A H G L F P H C G L L L D T R T L E V Q S D Y S R 



GTGAGCGCACCTGGCCGGAAGTGGAGCCTGTGCCCGGCTGGGGCAGGTGCTGCTGCAGGGCCGTTGCGTCCACCTCTGCTTCCGTGTGGGGCAGGCGACTGCCAATCCCAAAGGGTCAGA 



TGCCACAGGGTGCCCCTCGTCCCATCTGGGGCTGAGCACAAATGCATCTTTCTGTGGGAGTGAGGGTGCCTCACAACGGGAGCAGTTTTCTGTGCTATTTTGGTAA. 



Fig. 11 AN 




Lacks motif A and altered C-terminus (ver. 2) 



AT6CCGCGCGCTCCCCGCTGCCGAGCCGTGCGCTCCCTGCTGCGCA6CCACTACCGCGAGGTGCTGCCGCTGGCCACGTTC6T6 
H P R A P R C R A V R S L L R S H Y R £ V I P I A T f V 



CGGCGCCTGGGGCCCCAGGGCTGGCGGCTGGTGCAGCGCGGGGACCCGGCGGCTTTCCGCGCGCTGGTGGCCCAGTGCCTGGTGTGCGTGCCCTGGGACGCACGGCCGCCCCCCGCCGC 
R R L G P Q G H R I V Q R G 0 P A A f R A L V A Q C L V C V P W 0 A R P P P A A 



GGCCTCCCCGGGGTCGGCGTCCGGCTGGGGTTGAGGGCGGCCGGGGGGAACCAGCGACATGCGGAGAGCAGCGCAGGCGACTCAGGGCGCTTCCCCCGCAGGTG 
GLPGVGVRLGLRAAGGNORHAESSAGOSGRFPRR 
A S P G S A S G W G * G R P G G T S D H R R A A Q A T Q G A S P A G 
P P R G R R P A G V E G G R G E P A T C G E Q R R R L R A L P P Q V 

CCCCTCCTTCCGCCAGGTGTCCTGCCTGAAGGAGCTGGTGGCCCGAGTGCTGCAGAGGCTGTGCGAGCGCGGCGCGAAGAACGTGCTGGCCTTCGGCTTCGCGCTGCTGGACGGGGCCCG 
PSFRQVSCLKELVARVLQRLCERGAKNVLAFGFALLDGAR 

CGGGGGCCCCCCCGAGGCCTTCACCACCAGCGTGCGCAGCTACCTGCCCMCACGGTGACCGACGCACTGCGGGGGAGCGGGGCGTGGGGGCTGCTGCTGCGCCGCGTGGGCGACGACGT 
G G P P E A F T I S V R S Y L P N T V T 0 A L R G S G A H G L L L R R V G 0 0 V 

GCTGGTTCACCTGCTGGCACGCTGCGCGCTCTTTGTGCTGGTGGCTCCCAGCTGCGCCTACCAGGTGTGCGGGCCGCCGCTGTACCAGCTCGGCGCTGCCACTCAGGCCCGGCCCCCGCC 
L V H L L A R C A L F V L V A P S C A Y Q V C G P P L Y Q L G A A T 0 A R P P P 

ACACGCTAGTGGACCCCGAAGGCGTCTGGGATGCGAACGGGCCTGGAACCATAGCGTCAGGGAGGCCGGGGTCCCCCTGGGCCTGCCAGCCCCGGGTGCGAGGAGGCGCGGGGGCAGTGC 
H A S G P R R R L G C E R A W N H S V R E A G V P L G I P A P G A R R R G G S A 

CAGCCGAAGTCTGCCGTTGCCCAAGAGGCCCAGGCGTGGCGCTGCCCCTGAGCCGGAGCGGACGCCCGTTGGGCAGGGGTCCTGGGCCCACCCGGGCAGGACGCGTGGACCGAGTGACCG 
5 R S I P L P K R P R R G A A P E P E R T P V G Q G S W A H P G R T R G P S 0 R 

TGGTTTCTGTGTGGTGTCACCTGCCAGACCCGCCGAAGAAGCCACCTCTTTGGAGGGTGCGCTCTCTGGCACGCGCCACTCCCACCCATCCGTGGGCCGCCAGCACCACGCGGGCCCCCC 
G F C V V S P A R P A E E A T S I E G A L S G T R H S H P S V G R Q H H A G P P 

ATCCACATCGCGGCCACCACGTCCCTGGGACACGCCTTGTCCCCCGGTGTACGCCGAGACCAAGCACTTCCTCTACTCCTCAGGCGACAAGGAGCAGCTGCGGCCCTCCTTCCTACTCAG 
S f S R P P R P H 0 T P C P P V Y A E T K H f L Y S S G D K E Q L R P S F L L S 

CTCTCTGAGGCCCAGCCTGACTGGCGCTCGGAGGCTCGTGGAGACCATCTTTCTGGGTTCCAGGCCCTGGATGCCAGGGACTCCCCGCAGGTTGCCCCGCCTGCCCCAGCGCTACTGGCA 
S L R p S L T G A R R L V E T I F L G S R P H H P G T P R R L P R L P Q R Y W Q 

AATGCGGCCCCTGTTTCTGGAGCTGCTTGGGAACCACGCGCAGTGCCCCTACGGGGTGCTCCTCAAGACGCACTGCCCGCTGCGAGCTGCGGTCACCCCAGCAGCCGGTGTCTGTGCCCG 
H R P L f L E L L G N H A Q C P Y G V L L K T H C P L R A A V T P A A G V C A R 

GGAGAAGCCCCAGGGCTCTGTGGCGGCCCCCGAGGAGGAGGACACAGACCCCCGTCGCCTGGTGCAGCTGCTCCGCCAGCACAGCAGCCCCTGGCAGGTGTACGGCTTCGTGCGGGCCTG 
EKPQGSVAAPEEEDTDPRRlVQLLRQHSSPWQVYGfVRAC 

CCTGCGCCGGCTGGTGCCCCCAGGCCTCTGGGGCTCCAGGCACAACGAACGCCGCTTCCTCAGGAACACCAAGAAGTTCATCTCCCTGGGGAAGCATGCCAAGCTCTCGCTGCAGGAGCT 
1 R R L v p p 6 L W G S R H N E R R F L R N T IK F I S L G K H A K I S L 0 [ L 

Fig. 11AO 



6ACGTGGAAGATGAGCGTGCGGGACTGCGCTTGGCTGCGCAG6A6CCCAG6GGTT6GCTGTGTTCCGGCC6CAGA6CACC6TCTGCGTGAGGAGATCCTGGCCAAGTTCCTGCACTGGCT 
TWKMSVRDCAHLRRSPGVGCVPAAEHRLREE I LAKFLHWL 

GATGAGTGTGTACGTCGTCGAGCTGCTCAGGTCTTTCTTTTATGTCACGGAGACCACGTTTCAAAAGAACAGGCTCTTTTTCTACCGGAAGAGTGTCTGGAGCAAGTTGCAAAGCATTGG 
H S V y V V E L L R S F F Y V T E T T F Q K N R I F F Y R K S V W S K I Q S 1 G 

AATCAGACAGCACTTGAAGAGGGTGCAGCTGCGGGAGCTGTCGGAAGCAGAGGTCAGGCAGCATCGGGAAGCCAGGCCCGCCCTGCTGACGTCCAGACTCCGCTTCATCCCCAAGCCTGA 
I R Q H L K R V Q I R E I S E A E V R 0 H R E A R P A L L T S R L R F I P K P 0 

CGGGCTGCGGCCGATTGTGAACATGGACTACGTCGTGGGAGCCAGAACGTTCCGCAGAGAAAAGAGGGCCGAGCGTCTCACCTCGAGGGTGAAGGCACTGTTCAGCGTGCTCAACTACGA 
G L R P I V N M D Y V V G A R T F R R E' K R A E R L T $ R V K A L F S V L N Y E 

GCGGGCGCGGCGCCCCGGCCTCCTGGGCGCCTCTGTGCTGGGCCTGGACGATATCCACAGGGCCTGGCGCACCTTCGTGCTGCGTGTGCGGGCCCAGGACCCGCCGCCTGAGCTGTACTT 
R A R R P G L L G A $ V L G L D 0 I H R A If R T f V L R V R A Q D P P P E L Y F 

TGTCAAG GACAGGCTCACGGAGGTCATCGCCAGCATCATCAAACCCCAGAACACGTACTGCGTGCGTCGGTATGCCGTGGTCCA 
v K 0RLTEVIAS1IKPQNTYCVRRYAVVQ 

GAAGGCCGCCCATGGGCACGTCCGCAAGGCCTTCAAGAGCCACGTCTCTACCTTGACAGACCTCCAGCCGTACATGCGACAGTTCGTGGCTCACCTGCAGGAGACCAGCCCGCTGAGGGA 
« A A H G H V R K A F M H V S T L T 0 L Q P Y H R 0 F V A H L 0 E T S P L R 0 

TGCCGTCGTCATCGAGCAGAGCTCCTCCCTGAATGAGGCCAGCAGTGGCCTCTTCGACGTCTTCCTACGCTTCATGTGCCACCACGCCGTGCGCATCAGGGGCAAGTCCTACGTCCAGTG 
A V V 1 E Q S S S L N E A S S G L F D V F L R F N C H H A V R I R G K S Y V fl C 

CCAGGGGATCCCGCAGGGCTCCATCCTCTCCACGCTGCTCTGCAGCCTGTGCTACGGCGACATGGAGAACAAGCTGTTTGCGGGGATTCGGCGGGACGGGCTGCTCCTGCGTTTGGTGGA 
Q G I P Q G $ I L S T I I C $ L C Y G 0 M E N K L F A G I R R D G L I I R L V D 

TGATTTCTTGTTGGTGACACCTCACCTCACCCACGCGAAAACCTTCCTCAGGACCCTGGTCCGAGGTGTCCCTGAGTATGGCTGCGTGGTGAACTTGCGGAAGACAGTGGTGAACTTCCC 
0 F L L V T P H L T H A K I F L R T I V R G V P E Y G C V V N L R K T V V N F P 

TGTAGAAGACGAGGCCCTGGGTGGCACGGCTTTTGTTCAGATGCCGGCCCACGGCCTATTCCCCTGGTGCGGCCTGCTGCTGGATACCCGGACCCTGGAGGTGCAGAGCGACTACTCCAG 
V E D E A L G G T A F V Q H P A H G L F P H C G L L L 0 T R T L E V Q S D Y S $ 

CTATGCCCGGACCTCCATCAGAGCCAGTCTCACCTTCAACCGCGGCTTCAAGGCTGGGAGGAACATGCGTCGCAAACTCTTTGGGGTCTTGCGGCTGAAGTGTCACAGCCTGTTTCTGGA 
YARTSIRASLTFNRGFKAGRNHRRKLFGVLRLKCHSIFID 

TTTGCAGGTGAACAGCCTCCAGACGGTGTGCACCAACATCTACAAGATCCTCCTGCTGCAGGCGTACAGGTTTCACGCATGTGTGCTGCAGCTCCCATTTCATCAGCAAGTTTGGAAGAA 
L 0 V N S I Q T V C T N I Y K I I L L Q A Y R F H A C V L Q L P F H Q Q V H K N 

CCCCACATTTTTCCTGCGCGTCATCTCTGACACGGCCTCCCTCTGCTACTCCATCCTGAAAGCCAAGAACGCAGGGATGTCGCTGGGGGCCAAGGGCGCCGCCGGCCCTCTGCCCTCCGA 
P T F F L R V I S 0 T A $ L C Y $ I L K A K N A E 



EENILVVTPAVLGS 

GGGACAGCCAGAGATGGAGCCACCCCGCAGACCGTCGGGTGTGGGCAGCTTTCCGGTGTCTCCTGGGAGGGGAGTTGGGCTGGGCCTGTGACTCCTCAGCCTCTGTTTTCCCCCAG 
6 Q p F- H E P P R R P S G V G S F P V $ P G R G V G L G L * 
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Fig. 13A 



LOCUS pAKI28.4 7615 bp dsDNA Circular 

DEFINITION Human telomerase clone with exon beta spliced out 
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tcgacctgca 
aaccctggcg 
aatagcgaag 
tggcgcctga 
tccctgtttt 
aagcggtctg 
catgccgaac 
gagagtaggg 
ttcgttttat 
cggatttgaa 
ctgccaggca 
aaactcttcc 
gtttttgcat 
ccgcatagtt 
gtctgctccc 
agaggttttc 
ttttataggt 
ccatcgaatg 
tcagggtggt 
cttatcagac 
aaaaagtgga 
tggcgggcaa 
cgtcgcaaat 
tgtcgatggt 
cgcaacgcgt 
tggaagctgc 
tcaacagtat 
cattgggtca 
tgcgtctggc 
gggaaggcga 
gcatcgttcc 
ccattaccga 
ccgaagacag 
tggggcaaac 
atcagctgtt 
ccgcctctcc 
tggaaagcgg 
caggctttac 
ggaaatgtgc 
ctcatgagac 
attcaacatt 
gctcacccag 
ggttacatcg 
aacgttttcc 
ttgacgccgg 



ggcatgcaag 
ttacccaact 
aggcccgcac 
tgcggtattt 
ggcggatgag 
ataaaacaga 
tcagaagtga 
aactgccagg 
ctgttgtttg 
cgttgcgaag 
tcaaattaag 
tgtcgtcata 
caggaaagca 
aagccagccc 
ggcatccgct 
accgtcatca 
taatgtcatg 
gtgcaaaacc 
gaatgtgaaa 
cgtttcccgc 
agcggcgatg 
acagtcgttg 
tgtcgcggcg 
agaacgaagc 
cagtgggctg 
ctgcactaat 
tattttctcc 
ccagcaaatc 
tggctggcat 
ctggagtgcc 
cactgcgatg 
gtccgggctg 
ctcatgttat 
cagcgtggac 
gcccgtctca 
ccgcgcgttg 
gcagtgagcg 
actttatgct 
gcggaacccc 
aataaccctg 
tccgtgtcgc 
aaacgctggt 
agaactggat 
aatgatgagc 
gcaagagcaa 



cttggcactg 
taatcgcctt 
cgatcgccct 
tctccttacg 
agaagatttt 
atttgcctgg 
aacgccgtag 
catcaaataa 
tcggtgaacg 
caacggcccg 
cagaaggcca 
tctacaagcc 
gggaatttat 
cgacacccgc 
tacagacaag 
ccgaaacgcg 
ataataatgg 
tttcgcggta 
ccagtaacgt 
gtggtgaacc 
gcggagctga 
ctgattggcg 
attaaatctc 
ggcgtcgaag 
atcattaact 
gttccggcgt 
catgaagacg 
gcgctgttag 
aaatatctca 
atgtccggtt 
ctggttgcca 
cgcgttggtg 
atcccgccgt 
cgcttgctgc 
ctggtgaaaa 
gccgattcat 
caacgcaatt 
tccgacctgc 
tatttgttta 
ataaatgctt 
ccttattccc 
gaaagtaaaa 
ctcaacagcg 
acttttaaag 
ctcggtcgcc 



gccgtcgttt 
gcagcacatc 
tcccaacagt 
catctgtgcg 
cagcctgata 
cggcagtagc 
cgccgatggt 
aacgaaaggc 
ctctcctgag 
gagggtggcg 
tcctgacgga 
atccccccac 
ggtgcactct 
caacacccgc 
ctgtgaccgt 
cgagacgaaa 
tttcttagac 
tggcatgata 
tatacgatgt 
aggccagcca 
attacattcc 
ttgccacctc 
gcgccgatca 
cctgtaaagc 
atccgctgga 
tatttcttga 
gtacgcgact 
cgggcccatt 
ctcgcaatca 
ttcaacaaac 
acgatcagat 
cggatatctc 
taaccaccat 
aactctctca 
gaaaaaccac 
taatgcagct 
aatgtaagtt 
aagaacctca 
tttttctaaa 
caataatatt 
ttttttgcgg 
gatgctgaag 
gtaagatcct 
ttctgctatg 
gcatacacta 



tacaacgtcg 
cccctttcgc 
tgcgcagcct 
gtatttcaca 
cagattaaat 
gcggtggtcc 
agtgtggggt 
tcagtcgaaa 
taggacaaat 
ggcaggacgc 
tggccttttt 
agatacggta 
cagtacaatc 
tgacgcgccc 
ctccgggagc 
gggcctcgtg 
gtgaggttct 
gcgcccggaa 
cgcagagtat 
cgtttctgcg 
caaccgcgtg 
cagtctggcc 
actgggtgcc 
ggcggtgcac 
tgaccaggat 
tgtctctgac 
gggcgtggag 
aagttctgtc 
aattcagccg 
catgcaaatg 
ggcgctgggc 
ggtagtggga 
caaacaggat 
gggccaggcg 
cctggcgccc 
ggcacgacag 
agctcactca 
cgtcaggtgg 
tacattcaaa 
gaaaaaggaa 
cattttgcct 
atcagttggg 
tgagagtttt 
tggcgcggta 
ttctcagaat 



tgactgggaa 
cagctggcgt 
gaatggcgaa 
ccgcataaat 
cagaacgcag 
cacctgaccc 
ctccccatgc 
gactgggcct 
ccgccgggag 
ccgccataaa 
gcgtttctac 
aactagcctc 
tgctctgatg 
tgacgggctt 
tgcatgtgtc 
atacgcctat 
gtacccgaca 
gagagtcaat 
gccggtgtct 
aaaacgcggg 
gcacaacaac 
ctgcacgcgc 
agcgtggtgg 
aatcttctcg 
gccattgctg 
cagacaccca 
catctggtcg 
tcggcgcgtc 
atagcggaac 
ctgaatgagg 
gcaatgcgcg 
tacgacgata 
tttcgcctgc 
gtgaagggca 
aatacgcaaa 
gtttcccgac 
ttaggcaccc 
cacttttcgg 
tatgtatccg 
gagtatgagt 
tcctgttttt 
tgcacgagtg 
cgccccgaag 
ttatcccgta 
gacttggttg 



Fig. 13B 



2701 agtactcacc agtcacagaa aagcatctta 
2761 gtgctgccat aaccatgagt gataacactg 
2821 gaccgaagga gctaaccgct tttttgcaca 
2881 gttgggaacc ggagctgaat gaagccatac 
2941 tagcaatggc aacaacgttg cgcaaactat 
3001 ggcaacaatt aatagactgg atggaggcgg 
3061 cccttccggc tggctggttt attgctgata 
3121 gtatcattgc agcactgggg ccagatggta 
3181 cggggagtca ggcaactatg gatgaacgaa 
3241 tgattaagca ttggtaactg tcagaccaag 
3301 aacttcattt ttaatttaaa aggatctagg 
3361 aaatccctta acgtgagttt tcgttccact 
3421 gatcttcttg agatcctttt tttctgcgcg 
3481 cgctaccagc ggtggtttgt ttgccggatc 
3541 ctggcttcag cagagcgcag ataccaaata 
3601 accacttcaa gaactctgta gcaccgccta 
3661 tggctgctgc cagtggcgat aagtcgtgtc 
3721 cggataaggc gcagcggtcg ggctgaacgg 
3781 gaacgaccta caccgaactg agatacctac 
3841 ccgaagggag aaagg.cggac aggtatccgg 
3901 cgagggagct tccaggggga aacgcctggt 
3961 tctgacttga gcgtcgattt ttgtgatgct 
4021 ccagcaacgc ggccttttta cggttcctgg 
4081 ttcctgcgtt atcccctgat tctgtggata 
4141 ccgctcgccg cagccgaacg accgagcgca 
4201 gcccaatacg caaaccgcct ctccccgcgc 
4261 ctcatgtttg acagcttatc atcgactgca 
4321 ccatcggaag ctgtggtatg gctgtgcagg 
4381 aaggcgcact cccgttctgg ataatgtttt 
4441 tattctgaaa tgagctgttg acaattaatc 
4501 cggataacaa tttcacacag gaaacagcga 
4561 tggcccgagt gctgcagagg ctgtgcgagc 
4621 tcgcgctgct ggacggggcc cgcgggggcc 
4681. gctacctgcc caacacggtg accgacgcac 
4741 tgcgccgcgt gggcgacgac gtgctggttc 
4801 tggtggctcc cagctgcgcc taccaggtgt 
4861 ccactcaggc ccggcccccg ccacacgcta 
4921 gggcctggaa ccatagcgtc agggaggccg 
4981 cgaggaggcg cgggggcagt gccagccgaa 
5041 gcgctgcccc tgagccggag cggacgcccg 
5101 ggacgcgtgg accgagtgac cgtggtttct 
5161 aagccacctc tttggagggt gcgctctctg 
5221 gccagcacca cgcgggcccc ccatccacat 
5281 gtcccccggt gtacgccgag accaagcact 
5341 tgcggccctc cttcctactc agctctctga 
5401 tggagaccat ctttctgggt tccaggccct 
5461 gcctgcccca gcgctactgg caaatgcggc 
5521 cgcagtgccc ctacggggtg ctcctcaaga 



cggatggcat gacagtaaga gaattatgca 
cggccaactt acttctgaca acgatcggag 
acatggggga tcatgtaact cgccttgatc 
caaacgacga gcgtgacacc acgatgcctg 
taactggcga actacttact ctagcttccc 
ataaagttgc aggaccactt ctgcgctcgg 
aatctggagc cggtgagcgt gggtctcgcg 
agccctcccg tatcgtagtt atctacacga 
atagacagat cgctgagata ggtgcctcac 
tttactcata tatactttag attgatttaa . 
tgaagatcct ttttgataat ctcatgacca 
gagcgtcaga ccccgtagaa aagatcaaag 
taatctgctg cttgcaaaca aaaaaaccac 
aagagctacc aactcttttt ccgaaggtaa 
ctgtccttct agtgtagccg tagttaggcc 
catacctcgc tctgctaatc ctgttaccag 
ttaccgggtt ggactcaaga cgatagttac 
ggggttcgtg cacacagccc agcttggagc 
agcgtgagca ttgagaaagc gccacgcttc 
taagcggcag ggtcggaaca ggagagcgca 
atctttatag tcctgtcggg tttcgccacc 
cgtcaggggg gcggagccta tggaaaaacg 
ccttttgctg gccttttgct cacatgttct 
accgtattac cgcctttgag tgagctgata 
gcgagtcagt gagcgaggaa gcggaagagc 
gttggccgat tcattaatgc agaattaatt 
cggtgcacca atgcttctgg cgtcaggcag 
tcgtaaatca ctgcataatt cgtgtcgctc 
ttgcgccgac atcataacgg ttctggcaaa 
atcggctcgt ataatgtgtg gaattgtgag 
tgaattcaga tctcaccatg aaggagctgg 
gcggcgcgaa gaacgtgctg gccttcggct 
cccccgaggc cttcaccacc agcgtgcgca 
tgcgggggag cggggcgtgg gggctgctgc 
acctgctggc acgctgcgcg ctctttgtgc 
gcgggccgcc gctgtaccag ctcggcgctg 
gtggaccccg aaggcgtctg ggatgcgaac 
gggtccccct gggcctgcca gccccgggtg 
gtctgccgtt gcccaagagg cccaggcgtg 
ttgggcaggg gtcctgggcc cacccgggca 
gtgtggtgtc acctgccaga cccgccgaag 
gcacgcgcca ctcccaccca tccgtgggcc 
cgcggccacc acgtccctgg gacacgcctt 
tcctctactc ctcaggcgac aaggagcagc 
ggcccagcct gactggcgct cggaggctcg 
ggatgccagg gactccccgc aggttgcccc 
ccctgtttct ggagctgctt gggaaccacg 
cgcactgccc gctgcgagct gcggtcaccc 

Fig. 13C 



cagcagccgg tgtctgtgcc cgggagaagc cccagggctc tgtggcggcc cccgaggagg 
aggacacaga cccccgtcgc ctggtgcagc tgctccgcca gcacagcagc ccctggcagg 
tgtacggctt cgtgcgggcc tgcctgcgcc ggctggtgcc cccaggcctc tggggctcca 
ggcacaacga acgccgcttc ctcaggaaca ccaagaagtt catctccctg gggaagcatg 
ccaagctctc gctgcaggag ctgacgtgga agatgagcgt gcgggactgc gcttggctgc 
gcaggagccc aggggttggc tgtgttccgg ccgcagagca ccgtctgcgt gaggagatcc 
tggccaagtt cctgcactgg ctgatgagtg tgtacgtcgt cgagctgctc aggtctttct 
tttatgtcac ggagaccacg tttcaaaaga acaggctctt tttctaccgg aagagtgtct 
ggagcaagtt gcaaagcatt ggaatcagac agcacttgaa gagggtgcag ctgcgggagc 
tgtcggaagc agaggtcagg cagcatcggg aagccaggcc cgccctgctg acgtccagac 
tccgcttcat ccccaagcct gacgggctgc ggccgattgt gaacatggac tacgtcgtgg 
gagccagaac gttccgcaga gaaaagaggg ccgagcgtct cacctcgagg gtgaaggcac 
tgttcagcgt gctcaactac gagcgggcgc ggcgccccgg cctcctgggc gcctctgtgc 
tgggcctgga cgatatccac agggcctggc gcaccttcgt gctgcgtgtg cgggcccagg 
acccgccgcc tgagctgtac tttgtcaagg tggatgtgac gggcgcgtac gacaccatcc 
cccaggacag gctcacggag gtcatcgcca gcatcatcaa accccagaac acgtactgcg 
tgcgtcggta tgccgtggtc cagaaggccg cccatgggca cgtccgcaag gccttcaaga 
gccacgtcct acgtccagtg ccaggggatc ccgcagggct ccatcctctc cacgctgctc 
tgcagcctgt gctacggcga catggagaac aagctgtttg cggggattcg gcgggacggg 
ctgctcctgc gtttggtgga tgatttcttg ttggtgacac ctcacctcac ccacgcgaaa 
acttcctcag gacctggtcc gaagtgtcct gagtatggct gcgtggtgaa cttgcggaag 
acagtggtga acttccctgt agaagacgaa gccctgggtg gcacggcttt tgttcagatg 
ccggcccacg gcctattccc ctggtgcggc ctgctgctgg atacccggac cctggaggtg 
cagagcgact actccagcta tgcccggacc tccatcagag ccagtctcac cttcaaccgc 
ggcttcaagg ctgggaggaa catgcgtcgc aaactctttg gggtcttgcg gctgaagtgt 
cacagcctgt ttctggattt gcaggtgaac agcctccaga cggtgtgcac caacatctac 
aagatcctcc tgctgcaggc gtacaggttt cacgcatgtg tgctgcagct cccatttcat 
cagcaagttt ggaagaaccc cacatttttc ctgcgcgtca tctctgacac ggcctccctc 
tgctactcca tcctgaaagc caagaacgca gccgaagaaa acatttctgt cgtgactcct 
gcggtgcttg ggtcgggaca gccagagatg gagccacccc gcagaccgtc gggtgtgggc 
agctttccgg, tgtctcctgg gaggggagtt gggctgggcc tgtgactcct cagcctctgt 
tt.tcccccag ggatgtcgct gggggccaag ggcgccgccg gccctctgcc ctccgaggcc 
gtgcagtggc tgtgccacca agcattcctg ctcaagctga ctcgacaccg tgtcacctac 
gtgccactcc tggggtcact caggacaggc aagtgtgggt ggaggccagt gcggg 
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LOCUS pAKI28.7 7797 bp dsDNA Circular 

DEFINITION Human telomerase clone with alternative C-terminus 
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tcgacctgca ggcatgcaag cttggcactg gccgtcgttt tacaacgtcg tgactgggaa 
aaccctggcg ttacccaact taatcgcctt gcagcacatc cccctttcgc cagctggcgt 
aatagcgaag aggcccgcac cgatcgccct tcccaacagt tgcgcagcct gaatggcgaa 
tggcgcctga tgcggtattt tctccttacg catctgtgcg gtatttcaca ccgcataaat 
tccctgtttt ggcggatgag agaagatttt cagcctgata cagattaaat cagaacgcag 
aagcggtctg ataaaacaga atttgcctgg cggcagtagc gcggtggtcc cacctgaccc 
catgccgaac tcagaagtga aacgccgtag cgccgatggt agtgtggggt ctccccatgc 
gagagtaggg aactgccagg catcaaataa aacgaaaggc tcagtcgaaa gactgggcct 
ttcgttttat ctgttgtttg tcggtgaacg ctctcctgag taggacaaat ccgccgggag 
cggatttgaa cgttgcgaag caacggcccg gagggtggcg ggcaggacgc ccgccataaa 
ctgccaggca tcaaattaag cagaaggcca tcctgacgga tggccttttt gcgtttctac 
aaactcttcc tgtcgtcata tctacaagcc atccccccac agatacggta aactagcctc 
gtttttgcat caggaaagca gggaatttat ggtgcactct cagta.caatc tgctctgatg 
ccgcatagtt aagccagccc cgacacccgc caacacccgc tgacgcgccc tgacgggctt 
gtctgctccc ggcatccgct tacagacaag ctgtgaccgt ctccgggagc tgcatgtgtc 
agaggttttc accgtcatca ccgaaacgcg cgagacgaaa gggcctcgtg atacgcctat 
ttttataggt taatgtcatg ataataatgg tttcttagac gtgaggttct gtacccgaca 
ccatcgaatg gtgcaaaacc tttcgcggta tggcatgata gcgcccggaa gagagtcaat 
tcagggtggt gaatgtgaaa ccagtaacgt tatacgatgt cgcagagtat gccggtgtct 
cttatcagac cgtttcccgc gtggtgaacc aggccagcca cgtttctgcg aaaacgcggg 
aaaaagtgga agcggcgatg gcggagctga attacattcc caaccgcgtg gcacaacaac 
tggcgggcaa acagtcgttg ctgattggcg ttgccacctc cagtctggcc ctgcacgcgc 
cgtcgcaaat tgtcgcggcg attaaatctc gcgccgatca actgggtgcc agcgtggtgg 
tgtcgatggt agaacgaagc ggcgtcgaag cctgtaaagc ggcggtgcac aatcttctcg 
cgcaacgcgt cagtgggctg atcattaact atccgctgga tgaccaggat gccattgctg 
tggaagctgc ctgcactaat gttccggcgt tatttcttga tgtctctgac cagacaccca 
tcaacagtat tattttctcc catgaagacg gtacgcgact gggcgtggag catctggtcg 
cattgggtca ccagcaaatc gcgctgttag cgggcccatt aagttctgtc tcggcgcgtc 
tgcgtctggc tggctggcat aaatatctca ctcgcaatca aattcagccg atagcggaac 
gggaaggcga ctggagtgcc atgtccggtt ttcaacaaac catgcaaatg ctgaatgagg 
gcatcgttcc cactgcgatg ctggttgcca acgatcagat ggcgctgggc gcaatgcgcg 
ccattaccga gtccgggctg cgcgttggtg cggatatctc ggtagtggga tacgacgata 
ccgaagacag ctcatgttat atcccgccgt taaccaccat caaacaggat tttcgcctgc 
tggggcaaac cagcgtggac cgcttgctgc aactctctca gggccaggcg gtgaagggca 
atcagctgtt gcccgtctca ctggtgaaaa gaaaaaccac cctggcgccc aatacgcaaa 
ccgcctctcc ccgcgcgttg gccgattcat taatgcagct ggcacgacag gtttcccgac 
tggaaagcgg gcagtgagcg caacgcaatt aatgtaagtt agctcactca ttaggcaccc 
caggctttac actttatgct tccgacctgc aagaacctca cgtcaggtgg cacttttcgg 
ggaaatgtgc gcggaacccc tatttgttta tttttctaaa tacattcaaa tatgtatccg 
ctcatgagac aataaccctg ataaatgctt caataatatt gaaaaaggaa gagtatgagt 
attcaacatt tccgtgtcgc ccttattccc ttttttgcgg cattttgcct tcctgttttt 
gctcacccag aaacgctggt gaaagtaaaa gatgctgaag atcagttggg tgcacgagtg 
ggttacatcg agaactggat ctcaacagcg gtaagatcct tgagagtttt cgccccgaag 
aacgttttcc aatgatgagc acttttaaag ttctgctatg tggcgcggta ttatcccgta 
ttgacgccgg gcaagagcaa ctcggtcgcc gcatacacta ttctcagaat gacttggttg 



Fig. 14B 



2701 agtactcacc agtcacagaa aagcatctta 
2761 gtgctgccat aaccatgagt gataacactg 
2821 gaccgaagga gctaaccgct tttttgcaca 
2881 gttgggaacc ggagctgaat gaagccatac 
2941 tagcaatggc aacaacgttg cgcaaactat 
3001 ggcaacaatt aatagactgg atggaggcgg 
3061 cccttccggc tggctggttt attgctgata 
3121 gtatcattgc agcactgggg ccagatggta 
3181 cggggagtca ggcaactatg gatgaacgaa 
3241 tgattaagca ttggtaactg tcagaccaag 
3301 aacttcattt ttaatttaaa aggatctagg 
3361 aaatccctta acgtgagttt tcgttccact 
3421 gatcttcttg agatcctttt tttctgcgcg 
3481 cgctaccagc ggtggtttgt ttgccggatc 
3541 ctggcttcag cagagcgcag ataccaaata 
3601 accacttcaa gaactctgta gcaccgccta 
3661 tggctgctgc cagtggcgat aagtcgtgtc 
3721 cggataaggc gcagcggtcg ggctgaacgg 
3781 gaacgaccta caccgaactg agatacctac 
3841 ccgaagggag aaaggcggac aggtatccgg 
3901 cgagggagct tccaggggga aacgcctggt 
3961 tctgacttga gcgtcgattt ttgtgatgct 
4021 ccagcaacgc ggccttttta cggttcctgg 
4081 ttcctgcgtt atcccctgat tctgtggata 
4141 ccgctcgccg cagccgaacg accgagcgca 
4201 gcccaatacg caaaccgcct ctccccgcgc 
4261 ctcatgtttg acagcttatc atcgactgca 
4321 ccatcggaag ctgtggtatg gctgtgcagg 
4381 aaggcgcact cccgttctgg ataatgtttt 
4441 tattctgaaa tgagctgttg acaattaatc 
4501 cggataacaa tttcacacag gaaacagcga 
4561 tggcccgagt gctgcagagg ctgtgcgagc 
4621 tcgcgctgct ggacggggcc cgcgggggcc 
4681 gctacctgcc caacacggtg accgacgcac 
4741 tgcgccgcgt gggcgacgac gtgctggttc 
4801 tggtggctcc cagctgcgcc taccaggtgt 
4861 ccactcaggc ccggcccccg ccacacgcta 
4921 gggcctggaa ccatagcgtc agggaggccg 
4981 cgaggaggcg cgggggcagt gccagccgaa 
5041 gcgctgcccc tgagccggag cggacgcccg 
5101 ggacgcgtgg accgagtgac cgtggtttct 
5161 aagccacctc tttggagggt gcgctctctg 
5221 gccagcacca cgcgggcccc ccatccacat 
5281 gtcccccggt gtacgccgag accaagcact 
5341 tgcggccctc cttcctactc agctctctga 
5401 tggagaccat ctttctgggt tccaggccct 
5461 gcctgcccca gcgctactgg caaatgcggc 
5521 cgcagtgccc ctacggggtg ctcctcaaga 



cggatggcat gacagtaaga gaattatgca 
cggccaactt acttctgaca acgatcggag 
acatggggga tcatgtaact cgccttgatc 
caaacgacga gcgtgacacc acgatgcctg 
taactggcga actacttact ctagcttccc 
ataaagttgc aggaccactt ctgcgctcgg 
aatctggagc cggtgagcgt gggtctcgcg 
agccctcccg tatcgtagtt atctacacga 
atagacagat cgctgagata ggtgcctcac 
tttactcata tatactttag attgatttaa 
tgaagatcct ttttgataat ctcatgacca 
gagcgtcaga ccccgtagaa aagatcaaag 
taatctgctg cttgcaaaca aaaaaaccac 
aagagctacc aactcttttt ccgaaggtaa 
ctgtccttct agtgtagccg tagttaggcc 
catacctcgc tctgctaatc ctgttaccag 
ttaccgggtt ggactcaaga cgatagttac 
ggggttcgtg cacacagccc agcttggagc 
agcgtgagca ttgagaaagc gccacgcttc 
taagcggcag ggtcggaaca ggagagcgca 
atctttatag tcctgtcggg tttcgccacc 
cgtcaggggg gcggagccta tggaaaaacg 
ccttttgctg gccttttgct cacatgttct 
accgtattac cgcctttgag tgagctgata 
gcgagtcagt gagcgaggaa gcggaagagc 
gttggccgat tcattaatgc agaattaatt 
cggtgcacca atgcttctgg cgtcaggcag 
tcgtaaatca ctgcataatt cgtgtcgctc 
ttgcgccgac atcataacgg ttctggcaaa 
atcggctcgt ataatgtgtg gaattgtgag 
tgaattcaga tctcaccatg aaggagctgg 
gcggcgcgaa gaacgtgctg gccttcggct 
cccccgaggc cttcaccacc agcgtgcgca 
tgcgggggag cggggcgtgg gggctgctgc 
acctgctggc acgctgcgcg ctctttgtgc 
gcgggccgcc gctgtaccag ctcggcgctg 
gtggaccccg aaggcgtctg ggatgcgaac 
gggtccccct gggcctgcca gccccgggtg 
gtctgccgtt gcccaagagg cccaggcgtg 
ttgggcaggg gtcctgggcc cacccgggca 
gtgtggtgtc acctgccaga cccgccgaag 
gcacgcgcca ctcccaccca tccgtgggcc 
cgcggccacc acgtccctgg gacacgcctt 
tcctctactc ctcaggcgac aaggagcagc 
ggcccagcct gactggcgct cggaggctcg 
ggatgccagg gactccccgc aggttgcccc 
ccctgtttct ggagctgctt gggaaccacg 
cgcactgccc gctgcgagct gcggtcaccc 
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5581 cagcagccgg tgtctgtgcc cgggagaagc 
5641 aggacacaga cccccgtcgc ctggtgcagc 
5701 tgtacggctt cgtgcgggcc tgcctgcgcc 
5761 ggcacaacga acgccgcttc ctcaggaaca 
5821 ccaagctctc gctgcaggag ctgacgtgga 
5881 gcaggagccc aggggttggc tgtgttccgg 
5941 tggccaagtt cctgcactgg ctgatgagtg 
6001 tttatgtcac ggagaccacg tttcaaaaga 
6061 ggagcaagtt gcaaagcatt ggaatcagac 
6121 tgtcggaagc agaggtcagg cagcatcggg 
6181 tccgcttcat ccccaagcct gacgggctgc 
6241 gagccagaac gttccgcaga gaaaagaggg 
6301 tgttcagcgt gctcaactac gagcgggcgc 
6361 tgggcctgga cgatatccac agggcctggc 
6421 acccgccgcc tgagctgtac tttgtcaagg 
6481 cccaggacag gctcacggag gtcatcgcca 
6541 tgcgtcggta tgccgtggtc cagaaggccg 
6601 gccacgtctc taccttgaca gacctccagc 
6661 aggagaccag cccgctgagg gatgccgtcg 
6721 ccagcagtgg cctcttcgac gtcttcctac 
6781 ggggcaagtc ctacgtccag tgccagggga 
6841 tctgcagcct gtgctacggc gacatggaga 
6901 ggctgctcct gcgtttggtg gatgatttct 
6961 aaacttcctc aggacctggt ccgaagtgtc 
7021 agacagtggt gaacttccct gtagaagacg 
7081 tgccggccca cggcctattc ccctggtgcg 
7141 tgcagagcga ctactccagc tatgcccgga 
7201 gcggcttcaa ggctgggagg aacatgcgtc 
7261 gtcacagcct gtttctggat ttgcaggtga 
7321 acaagatcct cctgctgcag gcgtacaggt 
7381 atcagcaagt ttggaagaac cccacatttt 
7441 tctgctactc catcctgaaa gccaagaacg 
7501 ctgcggtgct tgggtcggga cagccagaga 
7561 gcagctttcc ggtgtctcct gggaggggag 
7621 gttttccccc agggatgtcg ctgggggcca 
7681 ccgtgcagtg gctgtgccac caagcattcc 
7741 acgtgccact cctggggtca ctcaggacag 



cccagggctc tgtggcggcc cccgaggagg 
tgctccgcca gcacagcagc ccctggcagg 
ggctggtgcc cccaggcctc tggggctcca 
ccaagaagtt catctccctg gggaagcatg 
agatgagcgt gcgggactgc gcttggctgc 
ccgcagagca ccgtctgcgt gaggagatcc 
tgtacgtcgt cgagctgctc aggtctttct 
acaggctctt tttctaccgg aagagtgtct 
agcacttgaa gagggtgcag ctgcgggagc 
aagccaggcc cgccctgctg acgtccagac 
ggccgattgt gaacatggac tacgtcgtgg 
ccgagcgtct cacctcgagg gtgaaggcac 
ggcgccccgg cctcctgggc gcctctgtgc 
gcaccttcgt gctgcgtgtg cgggcccagg 
tggatgtgac gggcgcgtac gacaccatcc 
gcatcatcaa accccagaac acgtactgcg 
cccatgggca cgtccgcaag gccttcaaga 
cgtacatgcg acagttcgtg gctcacctgc 
tcatcgagca gagctcctcc ctgaatgagg 
gcttcatgtg ccaccacgcc gtgcgcatca 
tcccgcaggg ctccatcctc tccacgctgc 
acaagctgtt tgcggggatt cggcgggacg 
tgttggtgac acctcacctc acccacgcga 
ctgagtatgg ctgcgtggtg aacttgcgga 
aagccctggg tggcacggct tttgttcaga 
gcctgctgct ggatacccgg accctggagg 
cctccatcag agccagtctc accttcaacc 
gcaaactctt tggggtcttg cggctgaagt 
acagcctcca gacggtgtgc accaacatct 
ttcacgcatg tgtgctgcag ctcccatttc 
tcctgcgcgt catctctgac acggcctccc 
cagccgaaga aaacatttct gtcgtgactc 
tggagccacc ccgcagaccg tcgggtgtgg 
ttgggctggg cctgtgactc ctcagcctct 
agggcgccgc cggccctctg ccctccgagg 
tgctcaagct gactcgacac cgtgtcacct 
gcaagtgtgg gtggaggcca gtgcggg 
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LOCUS pAKI28.14 7688 bp dsDNA Circular 

DEFINITION Human telomerase clone with exon alpha spliced out 
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2701 agtactcacc agtcacagaa aagcatctta 
2761 gtgctgccat aaccatgagt gataacactg 
2821 gaccgaagga gctaaccgct tttttgcaca 
2881 gttgggaacc ggagctgaat gaagccatac 
2941 tagcaatggc aacaacgttg cgcaaactat 
3001 ggcaacaatt aatagactgg atggaggcgg 
3061 cccttccggc tggctggttt attgctgata 
3121 gtatcattgc agcactgggg ccagatggta 
3181 cggggagtca ggcaactatg gatgaacgaa 
3241 tgattaagca ttggtaactg tcagaccaag 
3301 aacttcattt ttaatttaaa aggatctagg 
3361 aaatccctta acgtgagttt tcgttccact 
3421 gatcttcttg agatcctttt tttctgcgcg 
3481 cgctaccagc ggtggtttgt ttgccggatc 
3541 ctggcttcag cagagcgcag ataccaaata 
3601 accacttcaa gaactctgta gcaccgccta 
3661 tggctgctgc cagtggcgat aagtcgtgtc 
3721 cggataaggc gcagcggtcg ggctgaacgg 
3781 gaacgaccta caccgaactg agatacctac 
3841 ccgaagggag aaaggcggac aggtatccgg 
3901 cgagggagct tccaggggga aacgcctggt 
3961 tctgacttga gcgtcgattt ttgtgatgct 
4021 ccagcaacgc ggccttttta cggttcctgg 
4081 ttcctgcgtt atcccctgat tctgtggata 
4141 ccgctcgccg cagccgaacg accgagcgca 
4201 gcccaatacg caaaccgcct ctccccgcgc 
4261 ctcatgtttg acagcttatc atcgactgca 
4321 ccatcggaag ctgtggtatg gctgtgcagg 
4381 aaggcgcact cccgttctgg ataatgtttt 
4441 tattctgaaa tgagctgttg acaattaatc 
4501 cggataacaa tttcacacag gaaacagcga 
4561 tggcccgagt gctgcagagg ctgtgcgagc 
4621 tcgcgctgct ggacggggcc cgcgggggcc 
4681 gctacctgcc caacacggtg accgacgcac. 
4741 tgcgccgcgt gggcgacgac gtgctggttc 
4801 tggtggctcc cagctgcgcc taccaggtgt 
4861 ccactcaggc ccggcccccg ccacacgcta 
4921 gggcctggaa ccatagcgtc agggaggccg 
4981 cgaggaggcg cgggggcagt gccagccgaa 
5041 gcgctgcccc tgagccggag cggacgcccg 
5101 ggacgcgtgg accgagtgac cgtggtttct 
5161 aagccacctc tttggagggt gcgctctctg 
5221 gccagcacca cgcgggcccc ccatccacat 
5281 gtcccccggt gtacgccgag accaagcact 
5341 tgcggccctc cttcctactc agctctctga 
5401 tggagaccat ctttctgggt tccaggccct 
5461 gcctgcccca gcgctactgg caaatgcggc 
5521 cgcagtgccc ctacggggtg ctcctcaaga 



cggatggcat gacagtaaga gaattatgca 
cggccaactt acttctgaca acgatcggag 
acatggggga tcatgtaact cgccttgatc 
caaacgacga gcgtgacacc acgatgcctg 
taactggcga actacttact ctagcttccc 
ataaagttgc aggaccactt ctgcgctcgg 
aatctggagc cggtgagcgt gggtctcgcg 
agccctcccg tatcgtagtt atctacacga 
atagacagat cgctgagata ggtgcctcac 
tttactcata tatactttag attgatttaa 
tgaagatcct ttttgataat ctcatgacca 
gagcgtcaga ccccgtagaa aagatcaaag 
taatctgctg cttgcaaaca aaaaaaccac 
aagagctacc aactcttttt ccgaaggtaa 
ctgtccttct agtgtagccg tagttaggcc 
catacctcgc tctgctaatc ctgttaccag 
ttaccgggtt ggactcaaga cgatagttac 
ggggttcgtg cacacagccc agcttggagc 
agcgtgagca ttgagaaagc gccacgcttc 
taagcggcag ggtcggaaca ggagagcgca 
atctttatag tcctgtcggg tttcgccacc 
cgtcaggggg gcggagccta tggaaaaacg 
ccttttgctg gccttttgct cacatgttct 
accgtattac cgcctttgag tgagctgata 
gcgagtcagt gagcgaggaa gcggaagagc 
gttggccgat tcattaatgc agaattaatt 
cggtgcacca atgcttctgg cgtcaggcag 
tcgtaaatca ctgcataatt cgtgtcgctc 
ttgcgccgac atcataacgg ttctggcaaa 
atcggctcgt ataatgtgtg gaattgtgag 
tgaattcaga tctcaccatg aaggagctgg 
gcggcgcgaa gaacgtgctg gccttcggct 
cccccgaggc cttcaccacc agcgtgcgca 
tgcgggggag cggggcgtgg gggctgctgc 
acctgctggc acgctgcgcg ctctttgtgc 
gcgggccgcc gctgtaccag ctcggcgctg 
gtggaccccg aaggcgtctg ggatgcgaac 
gggtccccct gggcctgcca gccccgggtg 
gtctgccgtt gcccaagagg cccaggcgtg 
ttgggcaggg gtcctgggcc cacccgggca 
Qtgtggtgtc acctgccaga cccgccgaag 
gcacgcgcca ctcccaccca tccgtgggcc 
cgcggccacc acgtccctgg gacacgcctt 
tcctctactc ctcaggcgac aaggagcagc 
ggcccagcct gactggcgct cggaggctcg 
ggatgccagg gactccccgc aggttgcccc 
ccctgtttct ggagctgctt gggaaccacg 
cgcactgccc gctgcgagct gcggtcaccc 

Fig. 15C 



5581 cagcagccgg tgtctgtgcc cgggagaagc 

5641 aggacacaga cccccgtcgc ctggtgcagc 

5701 tgtacggctt cgtgcgggcc tgcctgcgcc 

5761 ggcacaacga acgccgcttc ctcaggaaca 

5821 ccaagctctc gctgcaggag ctgacgtgga 

5881 gcaggagccc aggggttggc tgtgttccgg 

5941 tggccaagtt cctgcactgg ctgatgagtg 

6001 tttatgtcac ggagaccacg tttcaaaaga 

6061 ggagcaagtt gcaaagcatt ggaatcagac 

6121 tgtcggaagc agaggtcagg cagcatcggg 

6181 tccgcttcat ccccaagcct gacgggctgc 

6241 gagccagaac gttccgcaga gaaaagaggg 

6301 tgttcagcgt gctcaactac gagcgggcgc 

6361 tgggcctgga cgatatccac agggcctggc 

6421 acccgccgcc tgagctgtac tttgtcaagg 

6481 tcaaacccag aacacgtact gcgtgcgtcg 

6541 gcacgtccgc aaggccttca agagccacgt 

6601 gcgacagttc gtggctcacc tgcaggagac 

6661 gcagagctcc tccctgaatg aggccagcag 

6721 gtgccaccac gccgtgcgca tcaggggcaa 

6781 gggctccatc ctctccacgc tgctctgcag 

6841 gtttgcgggg attcggcggg acgggctgct 

6901 gacacctcac ctcacccacg cgaaaacctt 

6961 gtatggctgc gtggtgaact tgcggaagac 

7021 cctgggtggc acggcttttg ttcagatgcc 

7081 gctgctggat acccggaccc tggaggtgca 

7141 catcagagcc agtctcacct tcaaccgcgg 

7201 actctttggg gtcttgcggc tgaagtgtca 

7261 cctccagacg gtgtgcacca acatctacaa 

7321 cgcatgtgtg ctgcagctcc catttcatca 

7381 gcgcgtcatc tctgacacgg cctccctctg 

7441 gatgtcgctg ggggccaagg gcgccgccgg 

7501 gtgccaccaa gcattcctgc tcaagctgac 

7561 ggggtcactc aggacagccc agacgcagct 

7621 tgccctggag gccgcagcca acccggcact 

7681 atctagag 



cccagggctc tgtggcggcc cccgaggagg 
tgctccgcca gcacagcagc ccctggcagg 
ggctggtgcc cccaggcctc tggggctcca 
ccaagaagtt catctccctg gggaagcatg 
agatgagcgt gcgggactgc gcttggctgc 
ccgcagagca ccgtctgcgt gaggagatcc 
tgtacgtcgt cgagctgctc aggtctttct 
acaggctctt tttctaccgg aagagtgtct 
agcacttgaa gagggtgcag ctgcgggagc 
aagccaggcc cgccctgctg acgtccagac 
ggccgattgt gaacatggac tacgtcgtgg 
ccgagcgtct cacctcgagg gtgaaggcac 
ggcgccccgg cctcctgggc gcctctgtgc 
gcaccttcgt gctgcgtgtg cgggcccagg 
acaggctcac ggaggtcatc gccagcatca 
gtatgccgtg gtccagaagg ccgcccatgg 
ctctaccttg acagacctcc agccgtacat 
cagcccgctg agggatgccg tcgtcatcga 
tggcctcttc gacgtcttcc tacgcttcat 
gtcctacgtc cagtgccagg ggatcccgca 
cctgtgctac ggcgacatgg agaacaagct 
cctgcgtttg gtggatgatt tcttgttggt 
cctcaggacc ctggtccgag gtgtccctga 
agtggtgaac ttccctgtag aagacgaggc 
ggcccacggc ctattcccct ggtgcggcct 
gagcgactac tccagctatg cccggacctc 
cttcaaggct gggaggaaca tgcgtcgcaa 
cagcctgttt ctggatttgc aggtgaacag 
gatcctcctg ctgcaggcgt acaggtttca 
gcaagtttgg aagaacccca catttttcct 
ctactccatc ctgaaagcca agaacgcagg 
ccctctgccc tccgaggccg tgcagtggct 
tcgacaccgt gtcacctacg tgccactcct 
gagtcggaag ctcccgggga cgacgctgac 
gccctcagac ttcaagacca tcctggactg 



Fig. 15D 




Fig. 9 



sequence "Y" 104-105 bases 

6GCCTCCCCG6GGTCGGCGTCCGGCTGGGGTT6AGGGCGGCCGGGGGGAACCAG 

GlyLeuProGlyValGlyValArgLeuGlyLeuArgAl aAl aGlyGlyAsnGl n 
Al aSerProGlySerAi aSerG1yTrpGl# * GlyArgProGlyGlyThrSer 

ProProArgGlyArgArgProAl aGlyValGl uGlyGlyArgGlyGluProAl a 

CGACATGCGGAGAGCAGCGCAGGCGACTCAGGGCGCTTCCCCCGCAGGTG 
ArgHisAl aGl uSerSerAl aGlyAspSerGlyArgPheProArgArg 
AspMetArgArgAlaAlaGlnAl aThrGlnGlyAl aSerProAl aGly 
ThrCysGlyGl uGl nArgArgArgLeuArgAl aLeuProProGl nVal 

sequence "1" 38 bases 
GTGGCTGTGCTTTGGTTTAACTTCCTTTTTAACCAGAA 
Val AlaVal LeuTrpPheAsnPheLeuPheAsnGl nLys 



sequence "a" 36 bases 

GTGGATGTGACGGGCGCGTACGACACCATCCCCCAG 

ValAspValThrGlyAlaTyrAspThrlleProGln 



sequence "p" 182 bases 

GTCTCTACCTTGACAGACCTCCAGCCGTACATGCGACAGTTCGTGGCTCACCTG 
ValSerThrLeuThrAspLeuGlnProTyrMetArgGlnPheValAl aHi sLeu 

CAGGAGACCAGCCCGCTGAGGGATGCCGTCGTCATCGAGCAGAGCTCCTCCCTG 
GlnGluThrSerProLeuArgAspAlaValVal II eGl uGl nSerSerSerLeu 

AATGAGGCCAGCAGTGGCCTCTTCGACGTCTTCCTACGCTTCATGTGCCACCAC 
AsnGl uAl aSerSerGlyLeuPheAspVal PheLeuArgPheMetCysHi sHis 

GCCGTGCGCATCAGGGGCAA 
AlaValArglleArgGlyLys 



partial sequence "2" unknown length 

GTGAGCGCACCTGGCCGGAAGTGGAGCCTGTGCCCGGCTGGGGCAGGTGCTGCTGCAG 
Ter 

GGCCGTTGCGTCCACCTCTGCTTCCGTGTGGGGCAGGCGACTGCCAATCCCAAAGGGT 
CAGATGCCACAGGGTGCCCCTCGTCCCATCTGGGGCTGAGCACAAATGCATCTTTCTG 
TGGGAGTGAGGGTGCCTCACAACGGGAGCAGTTTTCTGTGCTATTTTGGTAA... 



Fig. 10 A 



sequence "3" 159 bases 

CCGAAGAAAACATTTCTGTCGTGACTCCTGCGGTGCTTGGGTCGGGACAGCCAGA6 
AlaGluGluAsnlleSerValValThrProAlaValLeuGlySerGlyGlnProGlu 

ATGGAGCCACCCCGCAGACCGTCGGGTGTGGGCAGCTTTCCGGTGTCTCCTGGGAGG 
MetGluProProArgArgProSerGlyValGlySerPheProValSerProGlyArg 

GGAGTTGGGCTGGGCCTGTGACTCCTCAGCCTCTGTTTTCCCCCAG 
GlyValGlyLeuGlyLeu * 



sequence "X" unknown length 

...GACAGTCACCAGGGGGGTTGACCGCCGGACTGGGCGTCCCCAGGGTTGACTATAGGA 
CCAGGTGTCCAGGTGCCCTGCAAGTAGAGGGGCTCTCAGAGGCGTCTGGCTGGCATGG 
GTGGACGTGGCCCCGGGCATGGCCTTCTGCGTGTGCTGCCGTGGGTGCCCTGAGCCCT 
CACTGAGTCGGTGGGGGCTTGTGGCTTCCCGTGAGCTTCCCCCTAGTCTGTTGTCTGG 
CTGAGCAAGCCTCCTGAGGGGCTCTCTATTG 



partial sequence of genomic intron (approximately 2.7 kb) 
GTGGCTGTGCTTTGGTTTAACTTCCTTTTTAACCAGAAGTGCGTTTGAGCCCCACATT 
TGGTATCAGCTTAGATGAAGGGCCCGGAGGAGGGGCCACGGGACACAGCCAGGGCCAT 
GGCACGGCGCCCACCCATTTGTGCGCACAGTGAGGTGGCCGAGGTGCCGGTGCCTCCA 
GAAAAGCAGCGTGGGGGTGTAGGGGGAGCTCCTGGGGCAGGGAC .... 



Fig. JOB 



Truncated telomerase 

ATGCCGCGCGCTCCCCGCTGCCGAGCCGTGCGCTCCCTGCTGCGCAGCCACTACCGCGA6GTGCTGCC6CTGGCCACGTTCGTG 
MPRAPRCRAVRSILRSHYREVIPIATFV 

CGGCGCCTGGGGCCCCAGGGCTGGCGGCTGGTGCAGCGCGGGGACCCGGCGGCTTTCCGCGCGCTGGTGGCCCAGTGCCTGGTGTGCGTGCCCTGGGACGCACGGCCGCCCCCCGCCGC 
R R L G P Q 6 H R L V Q R G 0 P A A F R A L V A Q C I V C V P H D A R P P P A A 

CCCCTCCTTCCGCCAGGTGTCCTGCCTGAAGGAGCTGGTGGCCCGAGTGCTGCAGAGGCTGTGCGAGCGCGGCGCGAAGAACGTGCTGGCCTTCGGCTTCGCGCTGCTGGACGGGGCCCG 
P $ F R Q V 5 C I K E L V A R V I Q R L C E R G A K N V L A f G f A L L 0 G A R 

CGGGGGCCCCCCCGAGGCCTTCACCACCAGCGTGCGCAGCTACCTGCCCAACACGGTGACCGACGCACTGCGGGGGAGCGGGGCGTGGGGGCTGCTGCTGCGCCGCGTGGGCGACGACGT 
G G P P E A F T T S V R S Y I P N T V T D A L R G S G A H G L L L R R V G 0 0 V 

GCTGGTTCACCTGCTGGCACGCTGCGCGCTCTTTGTGCTGGTGGCTCCCAGCTGCGCCTACCAGGTGTGCGGGCCGCCGCTGTACCAGCTCGGCGCTGCCACTCAGGCCCGGCCCCCGCC 
L V H L L A R C A L F V L V A P S C A Y Q V C G P P L Y Q I G A A T Q A R P P P 

ACACGCTAGTGGACCCCGAAGGCGTCTGGGATGCGAACGGGCCTGGAACCATAGCGTCAGGGAGGCCGGGGTCCCCCTGGGCCTGCCAGCCCCGGGTGCGAGGAGGCGCGGGGGCAGTGC 
H A S G P R R R L G C E R A H N H S V R E A G V P L G L P A P G A R R R G G S A 

CAGCCGAAGTCTGCCGTTGCCCAAGAGGCCCAGGCGTGGCGCTGCCCCTGAGCCGGAGCGGACGCCCGTTGGGCAGGGGTCCTGGGCCCACCCGGGCAGGACGCGTGGACCGAGTGACCG 
S R S L P L P K R P R R G A A P E P E R T P V G Q G S W A H P G R T R G P S D R 

TGGTTTCTGTGTGGTGTCACCTGCCAGACCCGCCGAAGAAGCCACCTCTTTGGAGGGTGCGCTCTCTG6CACGCGCCACTCCCACCCATCCGTGGGCCGCCAGCACCACGCGGGCCCCCC 
G F C V V S P A R P A E E A T S L E G A L S G T R H S H P S V G R Q H H A G P P 

ATCCACATCGCGGCCACCACGTCCCTGGGACACGCCTTGTCCCCCGGTGTACGCCGAGACCAAGCACTTCCTCTACTCCTCAGGCGACAAGGAGCAGCTGCGGCCCTCCTTCCTACTCAG 
S T S R P P R P H 0 T P C P P V Y A E T K H F LY S S G 0 K E Q L R P S f L L S 

CTCTCTGAGGCCCAGCCTGACTGGCGCTCGGAGGCTCGTGGAGACCATCTTTCTGGGTTCCAGGCCCTGGATGCCAGGGACTCCCCGCAGGTTGCCCCGCCTGCCCCAGCGCTACTGGCA 
SLRPSLTGARRLVETIFLGSRPHMPGTPRRLPRLPQRYHQ 

AATGCGGCCCCTGTTTCTGGAGCTGCTTGGGAACCACGCGCAGTGCCCCTACGGGGTGCTCCTCAAGACGCACTGCCCGCTGCGAGCTGCGGTCACCCCAGCAGCCGGTGTCTGTGCCCG 
MRPLFLELIGNHAQCPYGVLIKTHCPLRAAVTPAAGVCAR 

GGAGAAGCCCCAGGGCTCTGTGGCGGCCCCCGAGGAGGAGGACACAGACCCCCGTCGCCTGGTGCAGCTGCTCCGCCAGCACAGCAGCCCCTGGCAGGTGTACGGCTTCGTGCGGGCCTG 
EKPQGSVAAPEEEDTOPRRLVQLLRQHSSPWQVYGfVRAC 

CCTGCGCCGGCTGGTGCCCCCAGGCCTCTGGGGCTCCAGGCACAACGAACGCCGCTTCCTCAGGAACACCAAGAAGTTCATCTCCCTGGGGAAGCATGCCAAGCTCTCGCTGCAGGAGCT 
LRRLVPPGLWGSRHHERRFLRNTKKFISLGKHAKLSLQEl 

GACGTGGAAGATGAGCGTGCGGGACTGCGCTTGGCTGCGCAGGAGCCCAGGGGTTGGCTGTGTTCCGGCCGCAGAGCACCGTCTGCGTGAGGAGATCCTGGCCAAGTTCCTGCACTGGCT 
I W K H S V R 0 C A W L R R S P G V G C V P A A E H R I R E E I L A K F L H N L 

GATGAGTGTGTACGTCGTCGAGCTGCTCAGGTCTTTCTTTTATGTCACGGAGACCACGTTTCAAAAGAACAGGCTCTTTTTCTACCGGAAGAGTGTCTGGAGCAAGTTGCAAAGCATTGG 
HSVYVVELLRSFFYV1ETTFQKNRLFFYRKSVHSKLQSIG 

AAT- -NNN- -GACAGTCACCAGGGGGGTTGACCGCCGGACTGGGCGTCCCCAGGGTTGACTATAGGACCAGGTGTCCAGGTGCCCTGCAAGTAGAGGGGCTCTCAGAGGCGTCTGGCTGG 




Fig. 11 A 



TG6ACGTGGCCCCGGGCATGGCCTTCTGCGTGTGCTGCCGTGG6TGCCCTGAGCCCTCACTGAGTCGGTGGGGGCTTGT66CTTCCCGTGAGCTTCCCCCTA6TCTGTTGTCTG 
GCTGAGCAAGCCTCCTGAGGGGCTCTCTATTG. 



Fig. 11B 



Truncated protein 1 

ATGCCGC6C6CTCCCCGCTGCCGAGCCGTGCGCTCCCTGCT6CGCAGCCACTACCGCGAGGTGCTGCCGCTGGCCACGTTC6TG 
MPRAPRCRAVRSILRSHYREVLPIATFV 

CGGCGCCTGGGGCCCCAGGGCTGGCGGCTGGTGCAGCGCGGGGACCCGGCG6CTTTCCGC6CGCTGGT6GCCCAGTGCCTGGTGTGC6TGCCCTGGGACGCACG6CCGCCCCCCGCCGC 
R R L G.P Q G V R L V Q R G D P A A F R A L V A Q C L V C V P H 0 A R P P P A A 

CCCCTCCTTCCGCCAGGTGTCCTGCCTGAAGGAGCTGGTGGCCCGAGTGCTGCAGAGGCTGTGCGAGCGCGGCGCGAAGAACGTGCTGGCCTTCGGCTTCGCGCTGCTGGACGGG6CCCG 
PSFRQVSCIKELVARVLQRLCERGAKNVLAFGFALLDGAR 

CGGGGGCCCCCCCGAGGCCTTCACCACCAGCGTGCGCAGCTACCTGCCCAACACGGTGACCGACGCACTGCGGGGGAGCGGGGCGTGGGGGCTGCTGCTGCGCCGCGTGGGCGACGACGT 
G G P P E A F T T S V R S Y L P N T V I D A L R G S G A W G L L I R R V G D 0 V 

GCTGGTTCACCTGCTGGCACGCTGCGCGCTCTTTGTGCTGGTGGCTCCCAGCTGCGCCTACCAGGTGTGCGGGCCGCCGCTGTACCAGCTCGGCGCTGCCACTCAGGCCCGGCCCCCGCC 
I V H I I A R C A t F V I V A P S C A Y Q V C G P P L Y Q L G A A T Q A R P P P 

ACACGCTAGTGGACCCCGAAGGCGTCTGGGATGCGAACGGGCCTGGAACCATAGCGTCAGGGAGGCCGGGGTCCCCCTGGGCCTGCCAGCCCCGGGTGCGAGGAGGCGCGGGGGCAGTGC 
H A S G P R R R I G C E R A W N H S V R E A G V P L G L P A P G A R R R G G S A 

CAGCCGAAGTCTGCCGTTGCCCAAGAGGCCCAGGCGTGGCGCTGCCCCTGA6CCGGAGCGGACGCCCGTTGGGCAGGGGTCCTGGGCCCACCCGGGCAGGACGCGTGGACCGAGTGACCG 
SRSLPLPKRPRRGAAPEPERTPVGQGSWAHPGRIRGPSDR 

TGGTTTCTGTGTGGTGTCACCTGCCAGACCCGCCGAAGAAGCCACCTCTTTGGAGGGTGCGCTCTCTGGCACGCGCCACTCCCACCCATCCGTGGGCCGCCAGCACCACGCGGGCCCCCC 
G F C V V S P A R P A E E A T S L E G A L S G I R H S H P S V G R Q H H A G P P 

ATCCACATCGCGGCCACCACGTCCCTGGGACACGCCTTGTCCCCCGGTGTACGCCGAGACCAAGCACTTCCTCTACTCCTCAGGCGACAAGGAGCAGCTGCGGCCCTCCTTCCTACTCAG 
STSRPPRPWDTPCPPVYAETKHFLYSSGDKEQLRPSFllS 

CTCTCTGAGGCCCAGCCTGACTGGCGCTCGGAGGCTCGTGGAGACCATCTTTCTGGGTTCCAGGCCCTGGATGCCAGGGACTCCCCGCAGGTTGCCCCGCCTGCCCCAGCGCTACTGGCA 
SLRPSITGARRLVETIFLGSRPHHPGTPRRLPRLPQRYDQ 

AATGCGGCCCCTGTTTCTGGAGCTGCTTGGGAACCACGCGCAGTGCCCCTACGGGGTGCTCCTCAAGACGCACTGCCCGCTGCGAGCTGCGGTCACCCCAGCAGCCGGTGTCTGTGCCCG 
MRPLFIELIGNHAQCPYGVIIKTHCPIRAAVTPAAGVCAR 

GGAGAAGCCCCAGGGCTCTGTGGCGGCCCCCGAGGAGGAGGACACAGACCCCCGTCGCCTGGTGCAGCTGCTCCGCCAGCACAGCAGCCCCTGGCAGGTGTACGGCTTCGTGCGGGCCTG 
EKPQGSVAAPEEEDTDPRRIVQIIRQHSSPWQVYGFVRAC 

CCTGCGCCGGCTGGTGCCCCCAGGCCTCTGGGGCTCCAGGCACAACGAACGCCGCTTCCTCAGGAACACCAAGAAGTTCATCTCCCTGGGGAAGCATGCCAAGCTCTCGCTGCAGGAGCT 
I R R L V P P G I W G S R H N E R R F I R N T K K f 1 $ L G K H A K I S L Q E L 

GACGTGGAAGATGAGCGTGCGGGACTGCGCTTGGCTGCGCAGGAGCCCAGGGGTTGGCTGTGTTCCGGCCGCAGAGCACCGTCTGCGTGAGGAGATCCTGGCCAAGTTCCTGCACTGGCT 
TWKHSVRDCAWIRRSPGVGCVPAAEHRIREEILAKFLHHL 

GATGAGT6TGTACGTCGTCGAGCTGCTCAGGTCTTTCTTTTATGTCACGGAGACCACGTTTCAAAAGAACAGGCTCTTTTTCTACCGGAAGAGTGTCTGGAGCAAGTTGCAAAGCATTGG 
HSVYVVELLRSFFYVIETTFQKNRLFFYRKSVWSKLQSIG 




Fig. 11C 



AATCAGACA6CACTTGAA6AGGGT6CAGCTGCGG6AGCTGTCGGAAGCAGA6GTCAGGCAGCATCGGGAAGCCAGGCCCGCCCTGCTGACGTCCAGACTCCGCTTCATCCCCAAGCCTGA 
IRQHLKRVOLRELSEAEVRQHREARPALLTSRLRFIPKPO 



GTGGCTGTKTTTGGTTTUCTTCCTTTTTAACCAGAA 
VAVLWFTFIFNQK 

CGGGCTGCGGCCGATTGTGAACATGGACTACGTCGTGGGAGCCA6AACGTTCCGCAGAGAAAAGAGGGCCGAGCGTCTCACCTC6AGGGTGAA6GCACTGTTCAGCGTGCTCAACTACGA 
GIRP1VNMOYVVGARTFRREKR PSVSFRG* 



Fig. I ID 



Truncated protein 2 



ATGCCGC6CGCTCCCC6CTGCC6A6CCGTGC6CTCCCTGCTGCGCAGCCACTACCGCGAGGTGCTGCC6CTGGCCACGTTCGTG 
H P R A P R C R A V R S L L R S H Y R E V L P I A T F V 

CG6CGCCTGGGGCCCCAGG6CT6GCGGCTGGTGCAGCGCGGGGACCCGGCGGCTTTCCGCGCGCTGGTGGCCCAGTGCCTGGTGTGCGTGCCCTGG6ACGCACG6CCGCCCCCCGCCGC 
R R L G P 0 G H R L V Q R G 0 P A A F R A L V A Q C I V C V P H 0 A R P P P A A 

CCCCTCCTTCCGCCAGGTGTCCTGCCTGAAGGAGCTGGTGGCCCGAGTGCTGCAGAGGCTGTGCGAGCGCGGCGCGAAGAACGTGCTGGCCTTCGGCTTCGCGCTGCTGGACGGGGCCCG 
PSFRQVSCIKELVARVLQRLCERGAKNVLAFGFAIIDGAR 

CGGGGGCCCCCCCGAGGCCTTCACCACCAGCGTGCGCAGCTACCTGCCCAACACGGTGACCGACGCACTGCGGGGGAGCGGGGCGTGGGGGCTGCTGCTGCGCCGCGTGGGCGACGACGT 
G G P P E A F T T S V R S Y L P N T V T D A I R G S G A W G L L' L R R V G D D V 

GCTGGTTCACCTGCTGGCACGCTGC6CGCTCTTTGTGCTGGTGGCTCCCAGCTGCGCCTACCAGGTGTGCGGGCCGCCGCTGTACCAGCTCGGCGCTGCCACTCAGGCCCGGCCCCCGCC 
L V H L L A R C A L F V L V A P S C A Y Q V C G P P L Y Q L G A A T Q A R P P P 

ACACGCTAGTGGACCCCGAAGGCGTCTGGGATGCGAACGGGCCTGGAACCATAGCGTCAGGGAGGCCGGGGTCCCCCTGGGCCTGCCAGCCCCGGGTGCGAGGAGGCGCGGGGGCAGTGC 
H A S G P R R R L G C E R A H N H S V R E A G V P L 6 L P A P G A R R R G G S A 

CAGCCGAAGTCTGCCGTTGCCCAAGAGGCCCAGGCGTGGCGCTGCCCCTGAGCCGGAGCGGACGCCCGTTGGGCAGGGGTCCTGGGCCCACCCGGGCAGGACGCGTGGACCGAGTGACCG 
SRSLPLPKRPRRGAAPEPERTPVGQGSWAHPGRTRGPSOR 

TGGTTTCTGTGTGGTGTCACCTGCCAGACCCGCCGAAGAAGCCACCTCTTTGGAGGGTGCGCTCTCTGGCACGCGCCACTCCCACCCATCCGTGGGCCGCCAGCACCACGCGGGCCCCCC 
G F C V V S P A R P A E E A T S L E G A t S G T R H S H P S V G R Q H H A G P P 

ATCCACATCGCGGCCACCACGTCCCTGGGACACGCCTTGTCCCCCGGTGTACGCCGAGACCAAGCACTTCCTCTACTCCTCAGGCGACAAGGA6CAGCTGCGGCCCTCCTTCCTACTCAG 
S T S R P P R P H 0 T P C P P V Y A E T K H F I Y S S G D K E Q L R P S F I L S 

CTCTCTGAGGCCCAGCCTGACTGGCGCTCGGAGGCTCGTGGAGACCATCTTTCTGGGTTCCAGGCCCTGGATGCCAGGGACTCCCCGCAGGTTGCCCCGCCTGCCCCAGCGCTACTGGCA 
SLRPSLTGARRLVETIFLGSRPWHPGTPRRLPRIPQRYHQ 

AATGCGGCCCCTGTTTCTGGAGCTGCTTGGGAACCACGCGCAGTGCCCCTACGGGGTGCTCCTCAAGACGCACTGCCCGCTGCGAGCTGCGGTCACCCCAGCAGCCGGTGTCTGTGCCCG 
MRPLFLELLGNHAQCPYGVILKTHCPLRAAVTPAAGVCAR 

GGAGAAGCCCCAGGGCTCTGTGGCGGCCCCCGAGGAGGAGGACACAGACCCCCGTCGCCTGGTGCAGCTGCTCCGCCAGCACAGCAGCCCCTGGCAGGTGTACGGCTTCGTGCGGGCCTG 
EKPQGSVAAPEEEOTDPRRLVQLLRQHSSPWQVYGFVRAC 

CCTGCGCCGGCTGGTGCCCCCAGGCCTCTGGGGCTCCAGGCACAACGAACGCCGCTTCCTCAGGAACACCAAGAAGTTCATCTCCCTGGGGAAGCATGCCAAGCTCTCGCTGCAGGAGCT 
IRRIVPPGLWGSRHNERRFLRNTKKFISIGKHAKISLQEL 

GACGTGGAAGATGAGCGTGCGGGACTGCGCTTGGCTGCGCAGGAGCCCAGGGGTTGGCTGTGTTCCGGCCGCAGAGCACCGTCTGCGTGAGGAGATCCTGGCCAAGTTCCTGCACTGGCT 
T H K H S V R D C A W I R R S P G V G C V P A A E H R L R E E I L A K F L H H L 

GATGAGTGTGTACGTCGTCGAGCTGCTCAGGTCTTTCTTTTATGTCACGGAGACCACGTTTCAAAAGAACAGGCTCTTTTTCTACCGGAAGAGTGTCTGGAGCAAGTTGCAAAGCATTGG 
MSVYVVELIRSFFYVTETTFQKNRIFFYRKSVWSKLQSIG 



Fig. HE 



8Y 

ATCAGACAGCACTT6AA6AGGGTGCA6CTGCGGGA6CTGTCGGAAGCAGAG6TCAGGCAGCATCGGGAAGCCAGGCCC6CCCT6CTGACGTCCAGACTCCGCTTCATCCCCAAGCCTGA 
I RQHLKRVQLRELSEAEVRQHREARPALLTSRLRF I PKPD 

CGGGCTGCGGCCGATTGTGAACATGGACTACGTCGTGGGAGCCAGAACGTTCCGCAGAGAAAAGAGGGCCGAGCGTCTCACCTCGAGGGTGAAGGCACTGTTCAGCGTGCTCAACTACGA 
GLRPIVNMDYVVGARTFRREKRAERLTSRVKALFSVLNYE 

GCGGGCGCGGCGCCCCGGCCTCCTGGGCGCCTCTGTGCTGGGCCTGGACGATATCCACAGGGCCTGGCGCACCTTCGTGCTGCGTGTGCGGGCCCAGGACCCGCCGCCTGAGCTGTACTT 
R A R R P G L L G A S V L G I 0 D I H R A W R T F V I R V R A Q D P P P E I Y F 

TGTCAAGGTGGATGTGACGGGCGCGTACGACACCATCCCCCAGGACAGGCTCACGGAGGTCATCGCCAGCATCATCAAACCCCAGAACACGTACTGCGTGCGTCGGTATGCCGTGGTCCA 
V K V 0 V T G A Y 0 T 1 P Q 0 R L I E V I A S 1 I K P-Q N T Y C V R R Y A V V Q 

GAAGGCCGCCCATGGGCACGTCCGCAAGGCCTTCAAGAGCCAC 
KAAHGHVRKAFKSH 

GTCCTACGTCCAGTG 
V L R P V 

CCAGGGGATCCCGCAGGGCTCCATCCTCTCCACGCTGCTCTGCAGCCTGTGCTACGGCGACATGGAGAACAAGCTGTTTGCGGGGATTCGGCGGGACGGGCTGCTCCTGCGTTTGGTGGA 
P G D P A G L H P L H A A L Q P V L R R H G E Q A V C G 0 S A G R A A P A F G G 

TGATTTCTTGTTGGTGACACCTCACCTCACCCACGCGAAAACCTTCCTCAGGACCCTGGTCCGAGGTGTCCCTGAGTATGGCTGCGTGGTGAACTTGCGGAAGACAGTGGTGAACTTCCC 



Fig. 11F 




Reference protein 



ATGCCGC6CGCTCCCCGCTGCCGAGCCGTGCGCTCCCTGCT6C6CAGCCACTACC6CGAG 60 

MetProArgAl aProArgCysArgAl aVal ArgSerLeuLeuArgSerHi sTyrArgGl u 20 

GTGCTGCC6CTG6CCACGTTCGTGCG6CGCCTGGGGCCCCAGGGCTGGCGGCTGGTGCAG 120 

Val LeuProLeuAl aThrPheVal ArgArgLeuGly ProGI nGl yTrpArgLeuVa 1 Gl n 40 

CGCGGGGACCCGGCGGCTTTCCGCGCGCTGGTGGCCCAGTGCCTGGTGTGCGTGCCCTGG 180 

ArgGlyAspProAl aAl aPheArgAl aLeuVal Al aGI nCysLeuVal CysVal ProTrp 60 

GACGCACGGCCGCCCCCCGCCGCCCCCTCCTTCCGCCAGGTGTCCTGCCTGAAGGAGCTG 240 

AspAlaArgProProProAlaAlaProSerPheArgGlnValSerCysLeuLysGluLeu 80 

GTGGCCCGAGTGCTGCAGAGGCTGTGCGAGCGCGGCGCGAAGAACGTGCTGGCCTTCGGC 300 

Val Al aArgVal LeuGl nArgLeuCysGl uArgGlyAl aLysAsnVal LeuAlaPheGly 100 

TTCGCGCTGCTGGACGGGGCCCGCGGGGGCCCCCCCGAGGCCTTCACCACCAGCGTGCGC 360 

PheAl aLeuLeuAspGlyAl aArgGlyGlyProProGl uAl aPheThrThrSerVal Arg 120 

AGCTACCTGCCCAACACGGTGACCGACGCACTGCGGGGGAGCGGGGCGTGGGGGCTGCTG 420 

SerTyrLeuProAsnThrValThrAspAlaLeuArgGlySerGlyAlaTrpGlyLeuLeu 140 

TTGCGCCGCGTGGGCGACGACGTGCTGGTTCACCTGCTGGCACGCTGCGCGCTCTTTGTG 480 

LeuArgArgVal GlyAspAspVal LeuVal Hi sLeuLeuAl aArgCysAl aLeuPheVal 160 

CTGGTGGCTCCCAGCTGCGCCTACCAGGTGTGCGGGCCGCCGCTGTACCAGCTCGGCGCT 540 

LeuVal Al aProSerCysAl aTyrGl nVal CysGlyProProLeuTyrGl nLeuGlyAl a 180 

GCCACTCAGGCCCGGCCCCCGCCACACGCTAGTGGACCCCGAAGGCGTCTGGGATGCGAA 600 

AlaThrGlnAlaArgProProProHisAlaSerGlyProArgArgArgLeuGlyCysGlu 200 

CGGGCCTGGAACCATAGCGTCAGGGAGGCCGGGGTCCCCCTGGGCCTGCCAGCCCCGGGT 660 

ArgAlaTrpAsnHisSerVal ArgGl uAlaGlyVal ProLeuGlyLeuProAlaProGly 220 

GCGAGGAGGCGCGGGGGCAGTGCCAGCCGAAGTCTGCCGTTGCCCAAGAGGCCCAGGCGT 720 

AlaArgArgArgGlyGlySerAlaSerArgSerLeuProLeuProLysArgProArgArg 240 

GGCGCTGCCCCTGAGCCGGAGCGGACGCCCGTTGGGCAGGGGTCCTGGGCCCACCCGGGC 780 

GlyAlaAlaProGluProGluArgThrProValGlyGlnGlySerTrpAlaHisProGly 260 

AGGACGCGTGGACCGAGTGACCGTGGTTTCTGTGTGGTGTCACCTGCCAGACCCGCCGAA 840 

ArgThrArgGlyProSerAspArgGlyPheCysVal Val SerProAl aArgProAl aGl u 280 

GAAGCCACCTCTTTGGAGGGTGCGCTCTCTGGCACGCGCCACTCCCACCCATCCGTGGGC 900 

GluAl aThrSerLeuGl uGlyAl aLeuSerGl yThrArgHi sSerHi sProSerVal Gly 300 
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C6CCAGCACCACGCGGGCCCCCCATCCACATCGCGGCCACCACGTCCCTGGGACACGCCT 
ArgGlnHisHisAlaGlyProProSerThrSerArgProProArgProTrpAspThrPro 




TGTCCCCCGGTGTACGCCGAGACCAAGCACTTCCTCTACTCCTCAGGCGACAAGGAGCAG 1020 

CysProProValTyrAl aGl uThrLysHi sPhelauTyrSerSarGlyAspLysGl uGl n 340 

CTGCGGCCCTCCTTCCTACTCAGCTCTCTGAGGCCCAGCCTGACTGGCGCTCGGAGGCTC 1080 

LeuArgProSerPheLeuLeuSerSerLeuArgProSerLeuThrGlyAl aArgArgLeu 360 

GTGGAGACCATCTTTCTGGGTTCCAGGCCCTGGATGCCAGGGACTCCCCGCAGGTTGCCC 1140 

ValGluThrllePheLeuGlySerArgProTrpMetProGlyThrProArgArgLeuPro 380 

CGCCTGCCCCAGCGCTACTGGCAAATGCGGCCCCTGTTTCTGGAGCTGCTTGGGAACCAC 1200 

ArgLeuProGlnArgTyrTrpGlnMetArgProLeuPheLeuGluLeuLeuGlyAsnHis 400 

GCGCAGTGCCCCTACGGGGTGCTCCTCAAGACGCACTGCCCG.CTGCGAGCTGCGGTCACC 1260 

AlaGlnCysProTyrGlyValLeuLeuLysThrHisCysProLeuArgAlaAlaValThr 420 

CCAGCAGCCGGTGTCTGTGCCCGGGAGAAGCCCCAGGGCTCTGTGGCGGCCCCCGAGGAG 1320 

ProAl aAl aGlyVal CysAl aArgGl uLysProGl nGlySerVal Al aAl aProGl uGl u 440 

GAGGACACAGACCCCCGTCGCCTGGTGCAGCTGCTCCGCCAGCACAGCAGCCCCTGGCAG 1380 

GluAspThrAspProArgArgLeuValGl nLeuLeuArgGl nHisSerSerProTrpGln 460 

GTGTACGGCTTCGTGCGGGCCTGCCTGCGCCGGCTGGTGCCCCCAGGCCTCTGGGGCTCC 1440 

ValTyrGlyPheValArgAlaCysLeuArgArgLeuValProProGlyLeuTrpGlySer 480 

AGGCACAACGAACGCCGCTTCCTCAGGAACACCAAGAAGTTCATCTCCCTGGGGAAGCAT 1500 

ArgHisAsnGluArgArgPheLeuArgAsnThrLysLysPhell eSerLeuGlyLysHi s 500 

GCCAAGCTCTCGCTGCAGGAGCTGACGTGGAAGATGAGCGTGCGGGGCTGCGCTTGGCTG 1560 

Al aLysLeuSerLeuGl nGl uLeuThrTrpLysMetSerVal ArgAspCysAl aTrpLeu 520 

CGCAGGAGCCCAGGGGTTGGCTGTGTTCCGGCCGCAGAGCACCGTCTGCGTGAGGAGATC 1620 

ArgArgSerProGlyValGlyCysVal ProAl aAl aGl uHi sArgLeuArgGl uGluIl e 540 

CTGGCCAAGTTCCTGCACTGGCTGATGAGTGTGTACGTCGTCGAGCTGCTCAGGTCTTTC 1680 

LeuAlaLysPheLeuHisTrpLeuMetSerValTyrVal Val Gl uLeuLeuArgSerPhe 560 

TTTTATGTCACGGAGACCACGTTTCAAAAGAACAGGCTCTTTTTCTACCGGAAGAGTGTC 1740 

PheTyrVal ThrGl uThrThrPheGl nLysAsnArgLeuPhePheTy rArgLysSerVa 1 580 

TGGAGCAAGTTGCAAAGCATTGGAATCAGACAGCACTTGAAGAGGGTGCAGCTGCGGGAG 1800 

TrpSerLysLeuGlnSerlleGlylleArgGlnHisLeuLysArgValGlnLeuArgGlu 600 

CTGTCGGAAGCAGAGGTCAGGCAGCATCGGGAAGCCAGGCCCGCCCTGCTGACGTCCAGA 1860 

LeuSerGl uAl aGl uVal ArgGl nHi sArgGl uAl aArgProAl aLeuLeuThrSerArg 620 
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CTCCGCTTCATCCCCAAGCCTGACGGGCTGCGGCCGATTGTGAACATGGACTACGTCGTG 1920 

LeuArgPhelleProLysProAspGlyLeuArgProIleValAsnMetAspTyrValVal 640 

GGAGCCAGAACGTTCCGCAGAGAAAAGAGGGCCGAGCGTCTCACCTCGAGGGTGAAGGCA 1980 

GlyAlaArgThrPheArgArgGlulysArgAlaGluArgleuThrSerArgValLysAla 660 

CTGTTCAGCGTGCTCAACTACGAGCGGGCGCGGCGCCCCGGCCTCCTGGGCGCCTCTGTG 2040 

LeuPheSerVal LeuAsnTyrGluArgAlaArgArgProGlyLeuLeuGlyAlaSerVal 680 

CTGGGCCTGGACGATATCCACAGGGCCTGGCGCACCTTCGTGCTGCGTGTGCGGGCCCAG 2100 

LeuGly LeuAspAspIl eHi sArgAl aTrpArgThrPheVal LeuArgVal ArgAl aGl n 700 

GACCCGCCGCCTGAGCTGTACTTTGTCAAGGTGGATGTGACGGGCGCGTACGACACCATC. 2160 

AspProProProGl uLeuTy rPheVa 1 LysVal AspVal ThrGlyAl aTyrAspThrll e 720 

CCCCAGGACAGGCTCACGGAGGTCATCGCCAGCATCATCAAACCCCAGAACACGTACTGC 2220 

ProGlnAspArgLeuThrGl uVal II eAlaSerll ell eLysProGl nAsnThrTyrCys 740 

GTGCGTCGGTATGCCGTGGTCCAGAAGGCCGCCCATGGGCACGTCCGCAAGGCCTTCAAG 2280 

Val ArgArgTyrAl aVal Val Gl nLysAlaAl aHi sGlyHi sVal ArgLysAl aPheLys 760 

AGCCACGTCTCTACCTTGACAGACCTCCAGCCGTACATGCGACAGTTCGTGGCTCACCTG 2340 

SerHisValSerThrLeuThrAspLeuGlnProTyrMetArgGlnPheValAlaHisLeu 780 

CAGGAGACCAGCCCGCTGAGGGATGCCGTCGTCATCGAGCAGAGCTCCTCCCTGAATGAG 2400 

Gl nGluThrSerProLeuArgAspAl aVal Val II eGl uGl nSerSerSerLeuAsnGl u 800 

GCCAGCAGTGGCCTCTTCGACGTCTTCCTACGCTTCATGTGCCACCACGCCGTGCGCATC 2460 

AlaSerSerGlyLeuPheAspValPheLeuArgPheMetCysHisHisAlaValArglle 820 

AGGGGCAAGTCCTACGTCCAGTGCCAGGGGATCCCGCAGGGCTCCATCCTCTCCACGCTG 2520 

ArgGlyLysSerTyrVal Gl nCysGl nGly II eProGlnGlySerll eLeuSerThrLeu 840 

CTCTGCAGCCTGTGCTACGGCGACATGGAGAACAAGCTGTTTGCGGGGATTCGGCGGGAC 2580 

LeuCysSerLeuCysTyrGlyAspMetGluAsnLysLeuPheAlaGlylleArgArgAsp 860 

GGGCTGCTCCTGCGTTTGGTGGATGATTTCTTGTTGGTGACACCTCACCTCACCCACGCG 2640 

GlyLeuLeuLeuArgLeuValAspAspPheLeuLeuValThrProHisLeuThrHisAla 880 

AAAACCTTCCTCAGGACCCTGGTCCGAGGTGTCCCTGAGTATGGCTGCGTGGTGAACTTG 2700 

LysThrPheLeuArgThrLeuValArgGlyValProGluTyrGlyCysValValAsnLeu 900 

CGGAAGACAGTGGTGAACTTCCCTGTAGAAGACGAGGCCCTGGGTGGCACGGCTTTTGTT 2760 

ArgLysThrVal Val AsnPheProVal Gl uAspGl uAl aLeuGlyGlyThrAl aPheVal 920 

CAGATGCCGGCCCACGGCCTATTCCCCTGGTGCGGCCTGCTGCTGGATACCCGGACCCTG 2820 

GlnMetProAlaHisGlyLeuPheProTrpCysGlyLeuLeuLeuAspThrArgThrLeu 940 
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GAGGTGCAGAGC6ACTACTCCAGCTATGCCCGGACCTCCATCAGAGCCAGTCTCACCTTC 2880 

Gl uVal Gl nSerAspTy rSerSerTyrAI aArgThrSerll eArgAl aSerLeuThrPhe 960 

AACCGCGGCTTCAA6GCTGGGAGGAACATGCGTCGCAAACTCTTTGGGGTCTTGCGGCTG 2940 

AsnArgGlyPheLysAl aGlyArgAsnMetArgArgLysLeuPheGlyVal LeuArgLeu 980 

AAGTGTCACAGCCTGTTTCTGGATTTGCAGGTGAACAGCCTCCAGACGGTGTGCACCAAC 3000 

LysCysHisSerLeuPheLeuAspLeuGlnValAsnSerLeuGlnThrValCysThrAsn 1000 

ATCTACAAGATCCTCCTGCTGCAGGCGTACAGGTTTCACGCATGTGTGCTGCAGCTCCCA 3060 

IT eTyrLysIl eLeuLeuLeuGI nAl aTyrArgPheHi sAl aCysVal LeuGl nLeuPro 1020 

TTTCATCAGCAAGTTTGGAAGAACCCCACATTTTTCCTGCGCGTCATCTCTGACACGGCC 3120 

PheHi sGlnGl nValTrpLysAsnProThrPhePheLeuArgVal 1 1 eSerAspThrAl s 1040 

TCCCTCTGCTACTCCATCCTGAAAGCCAAGAACGCAGGGATGTCGCTGGGGGCCAAGGGC 3180 

SerLeuCysTyrSerlleLeuLysAlaLysAsnAlaGlyMetSerLeuGlyAlaLysGly 1060 

GCCGCCGGCCCTCTGCCCTCCGAGGCCGTGCAGTGGCTGTGCCACCAAGCATTCCTGCTC 3240 

AlaAlaGlyProLeuProSerGluAlaValGlnTrpLeuCysHisGlnAlaPheLeuLeu 1080 

AAGCTGACTCGACACCGTGTCACCTACGTGCCACTCCTGGGGTCACTCAGGACAGCCCAG 3300 

LysLeuThrArgHisArgValThrTyrVal ProLeuLeuGlySerLeuArgThrAlaGln 1100 

ACGCAGCTGAGTCGGAAGCTCCCGGGGACGACGCTGACTGCCCTGGAGGCCGCAGCCAAC 3360 

ThrGlnLeuSerArgLysLeuProGlyThrThrLeuThrAlaLeuGluAlaAlaAlaAsn 1120 

CCGGCACTGCCCTCAGACTTCAAGACCATCCTGGAC 3420 

ProAl aLeuProSerAspPheLysThrll eLeuAsp 1132 
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Truncated 



protein 3 



ATGCCGCGCGCTCCCC6CT6CCCAGCCGT6CGCTCCCTGCT6CGCAGCCACTACC6CGAGGTGCT6CCGCTGGCCACGTTCGTG 
MPRAPRCRAVRSIIRSHYREVLPIAIFV 

C6GCGCCTGGGGCCCCAGGGCTGGCGGCTGGTGCAGCGCGGGGACCCGGCGGCTTTCCGCGCGCTGGTGGCCCAGTGCCTGGTGTGCGTGCCCTGGGACGCACGGCCGCCCCCCGCCGC 
R R L G P Q G W R L V Q R G 0 P A A F R A L V A Q C L V C V P H 0 A R P P P A A 

CCCCTCCTTCCGCCAGGTGTCCTGCCTGAAGGAGCTGGTGGCCCGAGTGCTGCAGAGGCTGTGCGAGCGCGGCGCGAAGAACGTGCTGGCCTTCGGCTTCGCGCTGCTGGACGGGGCCCG 
PSFRQVSCLKELVARVLQRICERGAKNVLAFGFALLOGAR 

CGGGGGCCCCCCCGAGGCCTTCACCACCAGCGTGCGCAGCTACCTGCCCAACACGGTGACCGACGCACTGCGGGGGAGCGGGGCGTGGGGGCTGCTGCTGCGCCGCGTGGGCGACGACGT 
G G P P E A F T T S V R S Y L P N I V I 0 A L R G S G A W G L L L R R V G 0 0 V 

GCTGGTTCACCTGCTGGCACGCTGCGCGCTCTTTGTGCTGGTGGCTCCCAGCTGCGCCTACCAGGTGTGCGGGCCGCCGCTGTACCAGCTCGGCGCTGCCACTCAGGCCCGGCCCCCGCC 
I V H I L A R C A L F V L V A P S C A Y 0 V C G P P L Y Q L G A A T Q A R P P P 

ACACGCTAGTGGACCCCGAAGGCGTCTGGGATGCGAACGGGCCTGGAACCATAGCGTCAGGGAGGCCGGGGTCCCCCTGGGCCTGCCAGCCCCGGGTGCGAGGAGGCGCGGGGGCAGTGC 
H A S G P R R R I G C E R A W N H S V R E A G V P L G L P A P G A R R R G G S A 

CAGCCGAAGTCTGCCGTTGCCCAAGAGGCCCAGGCGTGGCGCTGCCCCTGAGCCGGAGCGGACGCCCGTTGGGCAGGGGTCCTGGGCCCACCCGGGCAGGACGCGTGGACCGAGTGACCG 
SRSIPLPKRPRRGAAPEPERIPVGQGSNAHPGRTRGPSDR 

TGGTTTCTGTGTGGTGTCACCTGCCAGACCCGCCGAAGAAGCCACCTCTTTGGAGGGTGCGCTCTCTGGCACGCGCCACTCCCACCCATCCGTGGGCCGCCAGCACCACGCGGGCCCCCC 
G F C V V S P A R P A E E A T S L E G A L S G I R H S H P S V G R Q H H A G P P 

ATCCACATCGCGGCCACCACGTCCCTGGGACACGCCTTGTCCCCCGGTGTACGCCGAGACCAAGCACTTCCTCTACTCCTCAGGCGACAAGGAGCAGCTGCGGCCCTCCTTCCTACTCAG 
S1SRPPRPH0TPCPPVYAETKHFLYSSGDKEQLRPSFLLS 

CTCTCTGAGGCCCAGCCTGACTGGCGCTCGGAGGCTCGTGGAGACCATCTTTCTGGGTTCCAGGCCCTGGATGCCAGGGACTCCCCGCAGGTTGCCCCGCCTGCCCCAGCGCTACTGGCA 
SLRPSITGARRLVET1FLGSRPWHPGIPRRLPRLPQRYWQ 

AATGCGGCCCCTGTTTCTGGAGCTGCTTGGGAACCACGCGCAGTGCCCCTACGGGGTGCTCCTCAAGACGCACTGCCCGCTGCGAGCTGCGGTCACCCCAGCAGCCGGTGTCTGTGCCCG 
M R P L F L E L I G N H A Q C P Y G V L I K T H C P L R A A V T P A A G V C A R 

GGAGAAGCCCCAGGGCTCTGTGGCGGCCCCCGAGGAGGAGGACACAGACCCCCGTCGCCTGGTGCAGCTGCTCCGCCAGCACAGCAGCCCCTGGCAGGTGTACGGCTTCGTGCGGGCCTG 
EKPQGSVAAPEEEOTOPRRLVQLLRQHSSPWQVYGFVRAC 

CCTGCGCCGGCTGGTGCCCCCAGGCCTCTGGGGCTCCAGGCACAACGAACGCCGCTTCCTCAGGAACACCAAGAAGTTCATCTCCCTGGGGAAGCATGCCAAGCTCTCGCTGCAGGAGCT 
I R R I V P P G I W G S R H N E R R f L R N T K K F I $ L G K H A K L S I Q E I 

GACGTGGAAGATGAGCGTGCGGGACTGCGCTTGGCTGCGCAGGAGCCCAGGGGTTGGCTGTGTTCCGGCCGCAGAGCACCGTCTGCGTGAGGAGATCCTGGCCAAGTTCCTGCACTGGCT 
TUKHSVROCAHLRRSPGVGCVPAAEHRLREEILAKFLHHI 

GATGAGTGTGTACGTCGTCGAGCTGCTCAGGTCTTTCTTTTATGTCACGGAGACCACGTTTCAAAAGAACAGGCTCTTTTTCTACCGGAAGAGTGTCTGGAGCAAGTTGCAAAGCATTGG 
HSVYVVELLRSFFYVTETTFQKNRLFFYRKSVWSKLQSIG 
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AATCAGACA6CACTT6AA6AGGGTGCAGCTGC6GGAGCTGTCGGAAGCA6AG6TCAGGCA6CATCGGGAAGCCAGGCCCGCCCTGCTGACGTCCAGACTCC6CTTCATCCCCAAGCCTGA 
I RQH L KRVQL RELS E AEV RQHREARPAL L T S R L RF I P KP 0 

CGGGCTGCGGCCGATTGTGAACATGGACTACGTCGTGGGAGCCAGAACGTTCCGCAGAGAAAAGAGGGCCGAGCGTCTCACCTCGAGGGTGAAGGCACTGTTCAGCGTGCTCAACTACGA 
G I R P I V N M D Y V V G A R T F R R E K R A E R L T $ R V K A L f S V L N Y E 

GCGGGCGCGGCGCCCCGGCCTCCTGGGCGCCTCTGTGCTGGGCCTGGACGATATCCACAGGGCCTGGCGCACCTTCGTGCTGCGTGTGCGGGCCCAGGACCCGCCGCCTGAGCTGTACTT 
R A R R P G L L G A S V L G L 0 D I H R A W R I F V L R V R A Q 0 P P P E L Y F 

TGTCAAGGTGGATGTGACGGGCGCGTACGACACCATCCCCCAGGACAGGCTCACGGAGGTCATCGCCAGCATCATCAAACCCCAGAACACGTACTGCGTGCGTCGGTATGCCGTGGTCCA 
V K V D V T G A Y 0 I I P Q 0 R L I E V I A S 1 1 K P Q H T Y C V R R Y A V V Q 

GAAGGCCGCCCATGGGCACGTCCGCAAGGCCTTCAAGAGCCACGTCTCTACCTTGACAGACCTCCAGCCGTACATGCGACAGTTCGTGGCTCACCTGCAGGAGACCAGCCCGCTGAGGGA 
KAAHGHVRKAFKSHVSTLTDLQPYMRQFVAHIQETSPLRD 

TGCCGTCGTCATCGAGCAGAGCTCCTCCCTGAATGAGGCCAGCAGTGGCCTCTTCGACGTCTTCCTACGCTTCATGTGCCACCACGCCGTGCGCATCAGGGGCAAGTCCTACGTCCAGTG 
A V V I E 0 S S S L N E A S S G L F 0 V F L R f M C H H A V R I R G K S Y V Q C 

CCAGGGGATCCCGCAGGGCTCCATCCTCTCCACGCTGCTCTGCAGCCTGTGCTACGGCGACATGGAGAACAAGCTGTTTGCGGGGATTCGGCGGGACGGGCTGCTCCTGCGTTTGGTGGA 
Q G I P Q G S 1 L S T L L C S L C Y G 0 H E N K L F A G I R R D G I L L R L V 0 

TGATTTCTTGTTGGTGACACCTCACCTCACCCACGCGAAAACCTTCCTCAGGACCCTGGTCCGAGGTGTCCCTGAGTATGGCTGCGTGGTGAACTTGCGGAAGACAGTGGTGAACTTCCC 
0 F L L V T P H L T H A K T F L R T L V R G V P E Y G C V V N L R K T V V N F P 

TGTAGAAGACGAGGCCCTGGGTGGCACGGCTTTTGTTCAGATGCCGGCCCACGGCCTATTCCCCTGGTGCGGCCTGCTGCTGGATACCCGGACCCTGGAGGTGCAGAGCGACTACTCCAG 
VEDEALGGTAFVQMPAHGlfPWCGLLLDTRTLEVQSOYSR 



GTGAGCGCACCTGGCCGGAAGTGGAGCCTGTGCCCGGCTGGGGCAGGTGCTGCTGCAGGGCCGTTGCGTCCACCTCTGCTTCCGTGTGGGGCAGGCGACTGCCAATCCCAAAGGGTCAGA 



TGCCACAGGGTGCCCCTCGTCCCATCTGGGGCTGAGCACAAATGCATCTTTCTGTGGGAGTGAGGGTGCCTCACAACGGGAGCAGTTTTCTGTGCTATTTTGGTAA — 
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«t\ Altered C-terminus protein 

2 6 2004 p) 

ATGCCGCGCGCTCCCCGCTGCCGAGCCGTGCGCTCCCT6CTGCGCAGCCACTACCGCGAGGTGCTGCCGCTG6CCACGTTCGTG 
MPRAPRCRAVRSLLRSHYREVIPLATFV 

CGGCGCCTGGGGCCCCAGGGCTGGCGGCTGGTGCAGCGCGGGGACCCGGCGGCTTTCCGCGCGCTGGTGGCCCAGTGCCTGGTGTGCGTGCCCTGGGACGCACGGCCGCCCCCCGCCGC 
R R L G P Q G V R L V Q R G 0 P A A F R A L V A Q C I V C V P H 0 A R P P P A A 

CCCCTCCTTCCGCCAGGTGTCCTGCCTGAAGGAGCTGGTGGCCCGAGTGCTGCAGAGGCTGTGCGAGCGCGGCGCGAAGAACGTGCTGGCCTTCGGCTTCGCGCTGCTGGACGGGGCCCG 
P S F R Q V S C L K E L V A R V L Q R L C E R G A K N V L A F G f A L L 0 G A R 

CGGGGGCCCCCCCGAGGCCTTCACCACCAGCGTGCGCAGCTACCTGCCCAACACGGTGACCGACGCACTGCGGGGGAGCGGGGCGTGGGGGCTGCTGCTGCGCCGCGTGGGCGACGACGT 
G G P P E A F T I S ¥ R S Y L P N T V T D A L R G S G A W G L L L R R V G D D V 

GCTGGTTCACCTGCTGGCACGCTGCGCGCTCTTTGTGCTGGTGGCTCCCAGCTGCGCCTACCAGGTGTGCGGGCCGCCGCTGTACCAGCTCGGCGCTGCCACTCAGGCCCGGCCCCCGCC 
L V H I L A R C A L F V L V A P S C A Y Q V C G P P L Y Q I G A A T Q A R P P P 

ACACGCTAGTGGACCCCGAAGGCGTCTGGGATGCGAACGGGCCTGGAACCATAGCGTCAGGGAGGCCGGGGTCCCCCTGGGCCTGCCAGCCCCGGGTGCGAGGAGGCGCGGGGGCAGTGC 
H A S G P R R R L G C E R A W N H S V R E A G V P L G L P A P G A R R R G G S A 

CAGCCGAAGTCTGCCGTTGCCCAAGAGGCCCAGGCGIGGCGCTGCCCCTGAGCCGGAGCGGACGCCCGTTGGGCAGGGGTCCTGGGCCCACCCGGGCAGGACGCGTGGACCGAGTGACCG 
S R S L P L P K R P R R G A A P E P E R T P V G Q G S W A H P G R T R G P S D R 

TGGTTTCTGTGTGGTGTCACCTGCCAGACCCGCCGAAGAAGCCACCTCTTTGGAGGGTGCGCTCTCTGGCACGCGCCACTCCCACCCATCCGTGGGCCGCCAGCACCACGCGGGCCCCCC 
G F C V V S P A R P A E E A T S L E G A I S G T R H S H P S V G R Q H H A G P P 

ATCCACATCGCGGCCACCACGTCCCTGGGACACGCCTTGTCCCCCGGTGTACGCCGAGACCAAGCACTTCCTCTACTCCTCAGGCGACAAGGAGCAGCTGCGGCCCTCCTTCCTAC7CAG 
STSRPPRPWDTPCPPVYAETKHFLYSSGDKEQLRPSFLLS 

CTCTCTGAGGCCCAGCCTGACTGGCGCTCGGAGGCTCGTGGAGACCATCTTTCTGGGTTCCAGGCCCTGGATGCCAGGGACTCCCCGCAGGTTGCCCCGCCTGCCCCAGCGCTACTGGCA 
SLRPSLTGARRLVETIFLGSRPHHPGIPRRLPRIPQRYWQ 

AATGCGGCCCCTGTTTCTGGAGCTGCTTGGGAACCACGCGCAGTGCCCCTACGGGGTGCTCCTCAAGACGCACTGCCCGCTGCGAGCTGCGGTCACCCCAGCAGCCGGTGTCTGTGCCCG 
H R P L F L E L L G N H A Q C P Y G V L L K T H C P L R A A V I P A A G V C A R 

GGAGAAGCCCCAGGGCTCTGTGGCGGCCCCCGAGGAGGAGGACACAGACCCCCGTCGCCTGGTGCAGCTGCTCCGCCAGCACAGCAGCCCCTGGCAGGTGTACGGCTTCGIGCGGGCCTG 
EKPQGSVAAPEEEDTDPRRLVQLLRQHSSPWQVYGFVRAC 

CCTGCGCCGGCTGGTGCCCCCAGGCCTCTGGGGCTCCAGGCACAACGAACGCCGCTTCCTCAGGAACACCAAGAAGTTCATCTCCCTGGGGAAGCATGCCAAGCTCTCGCTGCAGGAGCT 
LRRLVPPGLHGSRHNERRFLRNTKKFISLGKHAKLSIQEL 

GACGTGGAAGATGAGCGTGCGGGACTGCGCTTGGCTGCGCAGGAGCCCAGGGGTTGGCTGTGTTCCGGCCGCAGAGCACCGTCTGCGTGAGGAGATCCTGGCCAAGTTCCTGCACTGGCT 
TWKHSVRDCAWLRRSPGVGCVPAAEHRLREEILAKFLHHL 

GATGAGTGTGTACGTCGTCGAGCTGCTCAGGTCTTTCTTTTATGTCACGGAGACCACGTTTCAAAAGAACAGGCTCTTTTTCTACCGGAAGAGTGTCTGGAGCAAGTTGCAAAGCATTGG 
HSVYVVEILRSFFYVTETTFQKNRLFFYRKSVHSKLQSIG 
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ATCAGACAGCACTTGAAGAG6GT6CAGCTGCGGGAGCTGTCG6AAGCAGAG6TCAGGCAGCATCG6GAAGCCAGGCCCGCCCTGCTGACGTCCAGACTCCGCTTCATCCCCAAGCCTGA 
IROHLKRVQLRELSEAEVRQHREARPALLTSRLRFIPKPD 

CGGGCTGCGGCCGATTGTGAACATGGACTACGTCGTGGGAGCCAGAACGTTCCGCAGAGAAAAGAGGGCCGAGCGTCTCACCTCGAGGGTGAAGGCACTGTTCAGCGTGCTCAACTACGA 
G L R P I V N H 0 Y V V G A R T f R R E K R A E R I T S R V R A L F S V L N Y E 

GCGGGCGCGGCGCCCCGGCCTCCTGGGCGCCTCTGTGCTGGGCCTGGACGATATCCACAGGGCCTGGCGCACCTTCGTGCTGCGTGTGCGGGCCCAGGACCCGCCGCCTGAGCTGTACTT 
R A R R P G L L G A S V L G L 0 0 1 H R A W R T F V L R V R A Q D P P P E I Y F 

TGTCAAGGTGGATGTGACGGGCGCGTACGACACCATCCCCCAGGACAGGCTCACGGAGGTCATCGCCAGCATCATCAAACCCCAGAACACGTACTGCGTGCGTCGGTATGCCGTGGTCCA 
M V 0 V T G A Y 0 T I P Q 0 R L T E V i A S 1 1 K P Q N T Y C V R R Y A V V Q 

GAAGGCCGCCCATGGGCACGTCCGCAAGGCCTTCAAGAGCCACGTCTCTACCTTGACAGACCTCCAGCCGTACATGCGACAGTTCGTGGCTCACCTGCAGGAGACCAGCCCGCTGAGGGA 
KAAHGHVRKAFKSHVSTLTOLQPYHRQFVAHLQETSPLRD 

TGCCGTCGTCATCGAGCAGAGCTCCTCCCTGAATGAGGCCAGCAGTGGCCTCTTCGACGTCTTCCTACGCTTCATGTGCCACCACGCCGTGCGCATCAGGGGCAAGTCCTACGTCCAGTG 
A V V I E Q S S S L N E A S S G L F 0 V f L R F H C H H A V R I R G K S Y V Q C 

CCAGGGGATCCCGCAGGGCTCCATCCTCTCCACGCTGCTCTGCAGCCTGTGCTACGGCGACATGGAGAACAAGCTGTTTGCGGGGATTCGGCGGGACGGGCTGCTCCTGCGTTTGGTGGA 
Q G I P 0 G S I L S T L L C S L C Y G 0 N E N K L F A G I R R 0 G L I L R L V 0 

TGATTTCTTGTTGGTGACACCTCACCTCACCCACGCGAAAACCTTCCTCAGGACCCTGGTCCGAGGTGTCCCTGAGTATGGCTGCGTGGTGAACTTGCGGAAGACAGTGGTGAACTTCCC 
D F L L V T P H I T H A K T F L R T L V R G V P E Y G C V V N I R K T V V N F P 

TGTAGAAGACGAGGCCCTGGGTGGCACGGCTTTTGTTCAGATGCCGGCCCACGGCCTATTCCCCTGGTGCGGCCTGCTGCTGGATACCCGGACCCTGGAGGTGCAGAGCGACTACTCCAG 
VEDEAlGGTAFVQMPAHGLFPWCGlllDTRTLEVQSDYSS 

CTATGCCCGGACCTCCATCAGAGCCAGTCTCACCTTCAACCGCGGCTTCAAGGCTGGGAGGAACATGCGTCGCAAACTCTTTGGGGTCTTGCGGCTGAAGTGTCACAGCCTGTTTCTGGA 
Y A R I S I R A 5 L T F N R G F K A G R N H R R K L F G V L R L K C H S L F L 0 

ITTGCAGGTGAACAGCCTCCAGACGGTGTGCACCAACATCTACAAGATCCTCCTGCTGCAGGCGTACAGGTTTCACGCATGTGTGCTGCAGCTCCCATTTCATCAGCAAGTTTGGAAGAA 
IQVNSLQTVCTNIYKILLLQAYRFHACVIQIPFHQQVWKN 

CCCCACATTTTTCCTGCGCGTCATCTCTGACACGGCCTCCCTCTGCTACTCCATCCTGAAAGCCAAGAACGCAGGGATGTCGCTGGGGGCCAAGGGCGCCGCCGGCCCTCTGCCCTCCGA 
PTFFLRVISDTASLCYSIIKAKNAE 



E E N I L V V T P A V L G S 

GGGACAGCCAGAGATGGAGCCACCCCGCAGACCGTCGGGTGTGGGCAGCTTTCCGGTGTCTCCTGGGAGGGGAGTTGGGCTGGGCCTGTGACTCCTCAGCCTCTGTTTTCCCCCAG 
GQPENEPPRRPSGVGSFPVSPGRGVGLGL* 



Fig. UN 



Protein that lacks motif A 



ATGCCGCGCGCTCCCCGCTGCC6A6CCGTGC6CTCCCTGCT6C6CAGCCACTACCGC6AG6T6CT6CCGCT6GCCACGTTCGT6 
H P R A P R C R A V R S L L R S H y R E V L P L A I F V 

CGGCGCCTGGGGCCCCAGGGCTGGCGGCTGGTGCAGCGCGGGGACCCGGCGGCTTTCCGCGCGCTGGTGGCCCAGTGCCTGGTGTGCGTGCCCTGGGACGCACGGCCGCCCCCCGCCGC 
R R L G P Q G K R L V Q R G D P A A F R A L V A Q C L V C V P W D A R P P P A A 

CCCCTCCTTCCGCCAGGTGTCCTGCCTGAAGGAGCTGGTGGCCCGAGTGCTGCAGAGGCTGTGCGAGCGCGGCGCGAAGAACGTGCTGGCCTTCGGCTTCGCGCTGCTGGACGGGGCCCG 
P S F R Q V S C L K E L V A R V L Q R L C E R G A K N V L A F G F A L L 0 G A R 

CGGGGGCCCCCCCGAGGCCTTCACCACCAGCGTGCGCAGCTACCTGCCCAACACGGIGACCGACGCACTGCGGGGGAGCGGGGCGTGGGGGCTGCTGCTGCGCCGCGTGGGCGACGACGI 
G G P P E A F T I S V R S y L P N T V T 0 A L R G S G A H G L L L R R V G 0 D V 

GCTGGTTCACCTGCTGGCACGCTGCGCGCTCTTTGTGCTGGTGGCTCCCAGCTGCGCCTACCAGGTGTGCGGGCCGCCGCTGTACCAGCTCGGCGCTGCCACTCAGGCCCGGCCCCCGCC 
L V H L L A R C A L F V L V A P S C A T Q V C G P P L Y Q L G A A T Q A R P P P 

ACACGCTAGTGGACCCCGAAGGCGTCTGGGATGCGAACGGGCCTGGAACCATAGCGTCAGGGAGGCCGGGGTCCCCCTGGGCCTGCCAGCCCCGGGTGCGAGGAGGCGCGGGGGCAGTGC 
H A S G P R R R L G C E R A H N H S V R E A G V P L G L P A P G A R R R G G S A 

CAGCCGAAGTCTGCCGTTGCCCAAGAGGCCCAGGCGTGGCGCTGCCCCTGAGCCGGAGCGGACGCCCGTTGGGCAGGGGICCTGGGCCCACCCGGGCAGGACGCGTGGACCGAGTGACCG 
SRSLPLPKRPRRGAAPEPERTPVGQGSNAHPGRTRGPSOR 

TGGTTTCTGTGTGGTGTCACCTGCCAGACCCGCCGAAGAAGCCACCTCTTTGGAGGGTGCGCTCTCTGGCACGCGCCACTCCCACCCATCCGTGGGCCGCCAGCACCACGCGGGCCCCCC 
G F C V V S P A R P A E E A T S L E G A L S G T R H S H P S V G R Q H H A G P P 

ATCCACATCGCGGCCACCACGTCCCTGGGACACGCCTTGTCCCCCGGTGTACGCCGAGACCAAGCACTTCCTCTACTCCTCAGGCGACAAGGAGCAGCTGCGGCCCTCCTTCCTACTCAG 
S T S R P P R P W 0 T P C P P V Y A E T K H F L Y S S G D K E Q L R P S F L I S 

CTCTCTGAGGCCCAGCCTGACTGGCGCTCGGAGGCTCGTGGAGACCATCTTTCTGGGTTCCAGGCCCTGGATGCCAGGGACTCCCCGCAGGTTGCCCCGCCTGCCCCAGCGCTACTGGCA 
SIRPSLTGARRIVETIFIGSRPWMPGTPRRIPRIPQRYWQ 

AATGCGGCCCCTGTTTCTGGAGCTGCTTGGGAACCACGCGCAGTGCCCCTACGGGGTGCTCCTCAAGACGCACTGCCCGCTGCGAGCTGCGGTCACCCCAGCAGCCGGTGTCTGTGCCCG 
H R P L F L E L L G N H A Q C P Y G V I L K T H C P L R A A V T P A A G V C A R 

GGAGAAGCCCCAGGGCTCTGTGGCGGCCCCCGAGGAGGAGGACACAGACCCCCGTCGCCTGGIGCAGCTGCTCCGCCAGCACAGCAGCCCCTGGCAGGTGTACGGCTTCGTGCGGGCCTG 
E K P Q G S V A A P E E E 0 T 0 P R R L V 0 L L R Q H S S P H Q V Y G F V R A C 

CCTGCGCCGGCTGGTGCCCCCAGGCCTCTGGGGCTCCAGGCACAACGAACGCCGCTTCCTCAGGAACACCAAGAAGTTCATCTCCCTGGGGAAGCATGCCAAGCTCTCGCTGCAGGAGCT 
LRRLVPPGLWGSRHNERRFLRNTKKFISLGKHAKISLQEL 

GACGTGGAAGATGAGCGTGCGGGACTGCGCTTGGCTGCGCAGGAGCCCAGGGGTTGGCTGTGTTCCGGCCGCAGAGCACCGTCTGCGTGAGGAGATCCTGGCCAAGTTCCTGCACTGGCT 
1HKMSVR0CAWLRRSPGVGCVPAAEHRLREEILAKFLHHL 

GATGAGTGTGTACGTCGTCGAGCTGCTCAGGTCTTTCTTTTATGTCACGGAGACCACGTTTCAAAAGAACAGGCTCTTTTTCTACCGGAAGAGTGTCTGGAGCAAGTTGCAAAGCATTGG 
M S V Y V V E L L R S f F Y V T E T T F Q K N R L F F Y R K S V W S K L 0 S I G 




Fig. no 



AATCAGACAGCACTTGAAGAGGGTGCAGCTGCGGGAGCTGTCGGAAGCAGAGGTCAG6CAGCATCGGGAAGCCAGGCCC6CCCTGCTGACGTCCAGACTCCGCTTCATCCCCAAGCCTGA 
IRQHLKRVQLRE LSEAEVRQHREARPALLTSRLRF I PKPO 

CGGGCTGCGGCCGATTGTGAACATGGACTACGTCGTGGGAGCCAGAACGTTCCGCAGAGAAAAGAGGGCCGAGCGTCTCACCTCGAGGGTGAAGGCACTGTTCAGCGTGCTCAACTACGA 
GIRPIVNMOYVVGARTFRREKRAERLTSRVKALFSVINYE 

GCGGGCGCGGCGCCCCGGCCTCCTGGGCGCCTCTGTGCTGGGCCTGGACGATATCCACAGGGCCTGGCGCACCTTCGTGCTGCGTGTGCGGGCCCAGGACCCGCCGCCTGAGCTGTACTT 
R A R R P G I I G A S ¥ L G I D 0 I H R A H R T f V I R V R A 0 D P P P E L Y F 

IGTCAAG GACAGGCTCACGGAGGTCATCGCCAGCATCATCAAACCCCAGAACACGTACTGCGTGCGTCGGTATGCCGTGGTCCA 

V K ORIIEVIASIIKPQNTYCVRRYAVVQ 

GAAGGCCGCCCATGGGCACGTCCGCAAGGCCTTCAAGAGCCACGTCTCTACCTTGACAGACCTCCAGCCGTACATGCGACAGTTCGTGGCTCACCTGCAGGAGACCAGCCCGCTGAGGGA 
KAAHGHVRKAFKSHVSTITDLQPYMRQFVAHIQETSPLRD 

TGCCGTCGTCATCGAGCAGAGCTCCTCCCTGAATGAGGCCAGCAGTGGCCTCTTCGACGTCTTCCTACGCTTCATGTGCCACCACGCCGTGCGCATCAGGGGCAAGTCCTACGTCCAGTG 
A V V I E Q S S S L N E A S S G I F 0 V F L R f H C H H A V R 1 R G K S Y V Q C 

CCAGGGGATCCCGCAGGGCTCCATCCTCTCCACGCTGCTCTGCAGCCTGTGCTACGGCGACATGGAGAACAAGCTGTTTGCGGGGATTCGGCGGGACGGGCTGCTCCTGCGTTTGGTGGA 
QG1PQGSILSTILCSLCYG0HENKLFAGIRRDGLLLRLVD 

TGATTTCTTGTTGGTGACACCTCACCTCACCCACGCGAAAACCTTCCTCAGGACCCTGGTCCGAGGTGTCCCTGAGTATGGCTGCGTGGTGAACTTGCGGAAGACAGTGGTGAACTTCCC 
OFLLVTPHLTHAKTFLRTLVRGVPEYGCVVNLRKTVVNFP 

TGTAGAAGACGAGGCCCTGGGTGGCACGGCTTTTGTTCAGATGCCGGCCCACGGCCTATTCCCCTGGTGCGGCCTGCTGCTGGATACCCGGACCCTGGAGGTGCAGAGCGACTACTCCAG 

V E 0 E A L G G T A F V Q N P A H G L F P W C G L L L D T R I L E V Q S D Y S S 

CTATGCCCGGACCTCCATCAGAGCCAGTCTCACCTTCAACCGCGGCTTCAAGGCTGGGAGGAACATGCGTCGCAAACTCTTTGGGGTCTTGCGGCTGAAGTGTCACAGCCTGTTTCTGGA 

V A R T S I R A S L T F N R G F K A G R N H R R K L F G V L R L K C H S L F I 0 

TTTGCAGGTGAACAGCCTCCAGACGGTGTGCACCAACATCTACAAGATCCTCCTGCTGCAGGCGTACAGGTTTCACGCATGTGTGCTGCAGCTCCCATTTCATCAGCAAGTTTGGAAGAA 
L Q V N S L Q T V C T N I Y K I L L L Q A Y R F H A C V L Q L P F H Q Q V W K N 

CCCCACATTTTTCCTGCGCGTCATCTCTGACACGGCCTCCCTCTGCTACTCCATCCTGAAAGCCAAGAACGCAGGGATGTCGCTGGGGGCCAAGGGCGCCGCCGGCCCTCTGCCCTCCGA 
P T F F L R V 1 S D T A S L C Y S I L K A K N A G H S L G A K G A A G P L P S E 

GGCCGTGCAGTGGC1GTGCCACCAAGCATTCCTGCTCAAGCTGACTCGACACCGTGTCACCTACGTGCCACTCCTGGGGTCACTCAGGACAGCCCAGACGCAGCTGAGTCGGAAGCTCCC 
AVQHLCHQAFLLKLTRHRVTYVPLLGSLRTAQTQLSRKLP 

GGGGACGACGCTGACTGCCCTGGAGGCCGCAGCCAACCCGGCACTGCCCTCAGACTTCAAGACCATCCTGGACTGATGGCCACCCGCCCACAGCCAGGCCGAGAGCAGACACCAGCAGCC 
G T T L T A L E A A A N P A L P S 0 F K T I L D 

CTGTCACGCCGGGCTCTACGTCCCAGGGAGGGAGGGGCGGCCCACACCCAGGCCCGCACCGCTGGGAGTCTGAGGCCTGAGTGAGTGTTTGGCCGAGGCCTGCATGTCCGGCTGAAGGCT 
GAGTGTCCGGCTGAGGCCTGAGCGAGTGTCCAGCCAAGGGCTGAGTGTCCAGCACACCTGCCGTCTTCACTTCCCCACAGGCTGGCGCTCGGCTCCACCCCAGGGCCAGCTTTTCCTCAC 
CAGGAGCCCGGCTTCCACTCCCCACATAGGAATAGTCCATCCCCAGATTCGCCATTGTTCACCCCTCGCCCTGCCCTCCTTTGCCTTCCACCCCCACCATCCAGGTGGAGACCCTGAGAA 



Fig. IIP 



G6ACCCTGGGAGCTCT6GGAATTTGGAGTGACCAAAGGTGTGCCCTGTACACAG6CGA6GACCCTGCACCTGGATGG66GTCCCTGTGGGTCAAATTGGGG6GAGGTGCTGTGGGAGTAA 
AATACTGAATATATGAGTTTTTCAGTTTTGA 



Fig. iiq 




Truncated protein that lacks motif A 



ATGCC6CGCGCTCCCC6CTGCCGAGCCGTGCGCTCCCT6CT6C6CAGCCACTACCGC6AGGTGCTGCCGCTGGCCACGTTCGTG 
M P R A P R C R A V R S L I R S H Y R E V L P L A T F V 



CGGCGCCTGGGGCCCCAGGGCTGGCGGCTGGTGCAGCGCGGGGACCCGGCGGCTTTCCGCGCGCTGGTGGCCCAGTGCCTGGTGTGCGTGCCCTGGGACGCACGGCCGCCCCCCGCCGC 
R R L G P Q G W R L V Q R G 0 P A A F R A L V A Q C I V C V P H 0 A R P P P A A 

CCCCTCCTTCCGCCAGGTGTCCTGCCTGAAGGAGCTGGTGGCCCGAGTGCTGCAGAGGCTGTGCGAGCGCGGCGCGAAGAACGTGCTGGCCTTCGGCTTCGCGCTGCTGGACGGGGCCCG 
P S F R Q V S C L K E L V A R V L Q R L C E R G A K N V L A F G F A I L D G A R 

CGGGGGCCCCCCCGAGGCCTTCACCACCAGCGTGCGCAGCTACCTGCCCAACACGGTGACCGACGCACTGCGGGGGAGCGGGGCGTGGGGGCTGCTGCTGCGCCGCGTGGGCGACGACGT 
G G P P E A F T T S V R S Y L P N T V T D A L R G S G A H G L L I R R V G 0 0 V 

GCTGGTTCACCTGCTGGCACGCTGCGCGCTCTTTGTGCTGGTGGCTCCCAGCTGCGCCTACCAGGTGTGCGGGCCGCCGCTGTACCAGCTCGGCGCTGCCACTCAGGCCCGGCCCCCGCC 
L V H L L A R C A L F V L ¥ A P S C A Y Q V C G P P L Y Q L G A A T Q A R P P P 

ACACGCTAGTGGACCCCGAAGGCGTCTGGGATGCGAACGGGCCTGGAACCATAGCGTCAGGGAGGCCGGGGTCCCCCTGGGCCTGCCAGCCCCGGGTGCGAGGAGGCGCGGGGGCAGTGC 
H A S G P R R R L G C E R A W N H S V R E A G V P L G L P A P G A R R R G G S A 

CAGCCGAAGTCTGCCGTTGCCCAAGAGGCCCAGGCGTGGCGCTGCCCCTGAGCCGGAGCGGACGCCCGTTGGGCAGGGGTCCTGGGCCCACCCGGGCAGGACGCGTGGACCGAGTGACCG 
S R S I P L P K R P R R G A A P E P E R T P V G Q G S W A H P G R T R G P S D R 

TGGTTTCTGTGTGGTGTCACCTGCCAGACCCGCCGAAGAAGCCACCTCTTTGGAGGGTGCGCTCTCTGGCACGCGCCACTCCCACCCATCCGTGGGCCGCCAGCACCACGCGGGCCCCCC 
GFCVVSPARPAEEATSLEGALSGTRHSHPSVGRQHHAGPP 

ATCCACATCGCGGCCACCACGTCCCTGGGACACGCCTTGTCCCCCGGTGTACGCCGAGACCAAGCACTTCCTCTACTCCTCAGGCGACAAGGAGCAGCTGCGGCCCTCCTTCCTACTCAG 
SISRPPRPWOTPCPPVYAETKHFIYSSGOKEQLRPSFLLS 

CTCTCTGAGGCCCAGCCTGACTGGCGCTCGGAGGCTCGTGGAGACCATCTTTCTGGGTTCCAGGCCCTGGATGCCAGGGACICCCCGCAGGTTGCCCCGCCTGCCCCAGCGCTACTGGCA 
S L R P S L T G A R R L V E T I F L G S R P W H P G T P R R L P R L P Q R Y H Q 

AATGCGGCCCCTGTTTCTGGAGCTGCTTGGGAACCACGCGCAGTGCCCCTACGGGGTGCTCCTCAAGACGCACTGCCCGCTGCGAGCTGCGGTCACCCCAGCAGCCGGTGTCTGTGCCCG 
H R P L F L E L L G N H A Q C P Y G V L L K T H C P L R A A V T P A A G V C A R 

GGAGAAGCCCCAGGGCTCTGTGGCGGCCCCCGAGGAGGAGGACACAGACCCCCGTCGCCTGGTGCAGCTGCTCCGCCAGCACAGCAGCCCCTGGCAGGTGTACGGCTTCGTGCGGGCCTG 
EKPQGSVAAPEEEDTOPRRLVQILRQHSSPWQVYGFVRAC 

CCTGCGCCGGCTGGTGCCCCCAGGCCTCTGGGGCTCCAGGCACAACGAACGCCGCTTCCTCAGGAACACCAAGAAGTTCATCTCCCTGGGGAAGCATGCCAAGCTCTCGCTGCAGGAGCT 
LRRLVPPGLHGSRHNERRFIRNTKKFISLGKHAKLSIQEL 

GACGTGGAAGATGAGCGTGCGGGACTGCGCTTGGCTGCGCAGGAGCCCAGGGGTTGGCTGTGTTCCGGCCGCAGAGCACCGTCTGCGTGAGGAGATCCTGGCCAAGTTCCTGCACTGGCT 
TWKMSVRDCAWLRRSPGVGCVPAAEHRLREEILAKFLHHl 

GATGAGTGTGTACGTCGTCGAGCTGCTCAGGTCTTTCTTTTATGTCACGGAGACCACGTTTCAAAAGAACAGGCTCTTTTTCTACCGGAAGAGTGTCTGGAGCAAGTTGCAAAGCATTGG 
MSVYVVELLRSFFYVTETTFQKNRLFFYRKSVHSKLQSIG 



Fig. 11 R 



AATCAGACA6CACTTGAA6AGGGTGCA6CTGCGGGAGCTGTCGGAAGCAGA6GTCAGGCAGCATC66GAAGCCA66CCCGCCCTGCTGAC6TCCAGACTCC6CTTCATCCCCAAGCCTGA 
IRQHLKRVOLRELSEAEVRQHREARPALLTSRLRF I PKPD 

CGGGCTGCGGCCGATTGTGAACATGGACTACGTCGTGGGAGCCAGAACGTTCCGCAGAGAAAAGAGGGCCGAGCGTCTCACCTCGAGGGTGAAGGCACTGTTCAGCGTGCTCAACTACGA 
G L R P i V N H 0 Y V V G A R T F R R [ K R A E R L T S R V K A L F S V L N Y E 

GCGGGCGCGGCGCCCCGGCCTCCTGGGCGCCTCTGTGCTGGGCCTGGACGATATCCACAGGGCCTGGCGCACCTTCGTGCTGCGTGTGCGGGCCCAGGACCCGCCGCCTGAGCTGTACTT 
R A R R P G L L G A S V L G L D D I H R A W R T F V I R V R A Q 0 P P P E L Y F 

TGTCAAG GACAGGCTCACGGAGGTCATCGCCAGCATCATCAAACCCCAGAACACGTACTGCGTGCGTCGGTATGCCGTGGTCCA 

V K DRLTEVIASI IKPQNTYCVRRYAVVQ 

GAAGGCCGCCCATGGGCACGTCCGCAAGGCCTTCAAGAGCCACGTCTCTACCTTGACAGACCTCCAGCCGTACATGCGACAGTTCGTGGCTCACCTGCAGGAGACCAGCCCGCTGAGGGA 
KAAHGHVRKAFKSHVSTLT.DIQPYHRQFVAHLQETSPIRD 

TGCCGTCGTCATCGAGCAGAGCTCCTCCCTGAATGAGGCCAGCAGTGGCCTCTTCGACGTCTTCCTACGCTTCATGTGCCACCACGCCGTGCGCATCAGGGGCAAGTCCTACGTCCAGTG 
A V V I E Q S S S I N E A S S G I F D V F L R F H C H H A V R I R G K S Y V Q C 

CCAGGGGATCCCGCAGGGCTCCATCCTCTCCACGCTGCTCTGCAGCCTGTGCTACGGCGACATGGAGAACAAGCTGTTTGCGGGGATTCGGCGGGACGGGCTGCTCCTGCGTTTGGTGGA 
QGIPQGSILSTLLCSLCYGOHENKLFAGiRROGLLLRLVD 

TGATTTCTTGTTGGTGACACCTCACCTCACCCACGCGAAAACCTTCCTCAGGACCCTGGTCCGAGGTGTCCCTGAGTATGGCTGCGTGGTGAACTTGCGGAAGACAGTGGTGAACTTCCC 
DFLLVTPHITHAKTFLRTIVRGVPEYGCVVNLRKTVVNFP 

TGTAGAAGACGAGGCCCTGGGTGGCACGGCTTTTGTTCAGATGCCGGCCCACGGCCTATTCCCCTGGTGCGGCCTGCTGCTGGATACCCGGACCCTGGAGGTGCAGAGCGACTACTCCAG 

V E D E A L G G T A F V Q M P A H G L F P W C G L L L 0 T R T L E V Q S 0 Y S R 



GTGAGCGCACCTGGCCGGAAGTGGAGCCTGTGCCCGGCTGGGGCAGGTGCTGCTGCAGGGCCGTTGCGTCCACCTCTGCTTCCGTGTGGGGCAGGCGACTGCCAATCCCAAAGGGTCAGA 



TGCCACAGGGTGCCCCTCGTCCCATCTGGGGCTGAGCACAAATGCATCTTTCTGTGGGAGTGAGGGTGCCTCACAACGGGAGCAGTTTTCTGTGCTATTTTGGTAA 



Fig. 11S 




Lacks motif A and altered C-termi nus 



ATGCCGCGCGCTCCCCGCT6CCGAGCCGTGCGCTCCCTGCT6CGCAGCCACTACC6CGA6GTGCTGCCGCTGGCCACGTTCGT6 
H P R A P R C R A V R S L L R S H Y R E V L P L A T F V 



CGGCGCCTGGGGCCCCAGGGCTG6CGGCTGGTGCAGCGCGGGGACCCGGCGGCTTTCCGCGCGCTGGTGGCCCAGTGCCTGGTGTGCGTGCCCTGGGACGCACGGCCGCCCCCCGCCGC 
R R I G P Q G H R L V Q R G 0 P A A F R A L V A Q C L V C V P H D A R P P P A A 

CCCCTCCTTCCGCCAGGTGTCCTGCCTGAAGGAGCTGGTGGCCCGAGTGCTGCAGAGGCTGTGCGAGCGCGGCGCGAAGAACGTGCTGGCCTTCGGCTTCGCGCTGCTGGACGGGGCCCG 
PSFRQVSCLKEIVARVLQRLCERGAKNVLAFGFAILOGAR 

CGGGGGCCCCCCCGAGGCCTTCACCACCAGCGTGCGCAGCTACCTGCCCAACACGGTGACCGACGCACTGCGGGGGAGCGGGGCGTGGGGGCTGCTGCTGCGCCGCGTGGGCGACGACGT 
G G P P E A F T T $ V R S Y L P N T V T D A L R G S G A W G L L L R R V G D D V 

GCTGGTTCACCTGCTGGCACGCTGCGCGCTCTTTGTGCTGGTGGCTCCCAGCTGCGCCTACCAGGTGTGCGGGCCGCCGCTGTACCAGCTCGGCGCTGCCACTCAGGCCCGGCCCCCGCC 
L V H L L A R C A L F ¥ L V A P S C A Y Q V C G P P L Y Q L G A A T 0 A R P P P 

ACACGCTAGTGGACCCCGAAGGCGTCTGGGATGCGAACGGGCCTGGAACCATAGCGTCAGGGAGGCCGGGGTCCCCCTGGGCCTGCCAGCCCCGGGTGCGAGGAGGCGCGGGGGCAGTGC 
H A S G P R R R L G C E R A W N H S V R E A G V P L G L P A P G A R R R G G S A 

CAGCCGAAGTCTGCCGTTGCCCAAGAGGCCCAGGCGTGGCGCTGCCCCTGAGCCGGAGCGGACGCCCGTTGGGCAGGGGTCCTGGGCCCACCCGGGCAGGACGCGTGGACCGAGTGACCG 
SRSLPLPKRPRRGAAPEPERTPVGQGSHAHPGRTRGPSOR 

TGGTTTCTGTGTGGTGTCACCTGCCAGACCCGCCGAAGAAGCCACCTCTTTGGAGGGTGCGCTCTCTGGCACGCGCCACTCCCACCCATCCGTGGGCCGCCAGCACCACGCGGGCCCCCC 
GFCVVSPARPAEEATSIEGALSGTRHSHPSVGRQHHAGPP 

ATCCACATCGCGGCCACCACGTCCCTGGGACACGCCTTGTCCCCCGGTGTACGCCGAGACCAAGCACTTCCTCTACTCCTCAGGCGACAAGGAGCAGCTGCGGCCCTCCTTCCTACTCAG 
STSRPPRPHOTPCPPVYAETKHFLYSSGDKEQLRPSFLLS 

CTCTCTGAGGCCCAGCCTGACTGGCGCTCGGAGGCTCGTGGAGACCATCTTTCTGGGTTCCAGGCCCTGGATGCCAGGGACTCCCCGCAGGTTGCCCCGCCTGCCCCAGCGCTACTGGCA 
SLRPSLTGARRLVETIFIGSRPHMPGIPRRLPRLPQRYWQ 

AATGCGGCCCCTGTTTCTGGAGCTGCTTGGGAACCACGCGCAGTGCCCCTACGGGGTGCTCCTCAAGACGCACTGCCCGCTGCGAGCTGCGGTCACCCCAGCAGCCGGTGTCTGTGCCCG 
M R P L F L E L L G N H A Q C P Y G V L L K T H C P L R A A V T P A A G V C A R 

GGAGAAGCCCCAGGGCTCTGTGGCGGCCCCCGAGGAGGAGGACACAGACCCCCGTCGCCTGGTGCAGCTGCTCCGCCAGCACAGCAGCCCCTGGCAGGTGTACGGCTTCGTGCGGGCCTG 
E K P Q G S V A A P E E E 0 T D P R R L V Q L L R Q H S S P W Q V Y G F V R A C 

CCTGCGCCGGCTGGTGCCCCCAGGCCTCTGGGGCTCCAGGCACAACGAACGCCGCTTCCTCAGGAACACCAAGAAGTTCATCTCCCTGGGGAAGCATGCCAAGCTCTCGCTGCAGGAGCT 
L R R I V P P G I W G $ R H N E R R F L R N T K K F 1 S I G K H..A K L $ I Q E I 

GACGTGGAAGATGAGCGTGCGGGACTGCGCTTGGCTGCGCAGGAGCCCAGGGGTTGGCTGTGTTCCGGCCGCAGAGCACCGTCTGCGTGAGGAGATCCTGGCCAAGTTCCTGCACTGGCT 
THKHSVROCAHLRRSPGVGCVPAAEHRLREE I LAKFIHWL 

GATGAGTGTGTACGTCGTCGAGCTGCTCAGGTCTTTCTTTTATGTCACGGAGACCACGTTTCAAAAGAACAGGCTCTTTTTCTACCGGAAGAGTGTCTGGAGCAAGTTGCAAAGCATTGG 
HSVYVVELLRSFFYVTETTFQKNRLFFYRKSVWSKLQSIG 



Fig. 11T 



ATCAGACAGCACTTGAAGAGGGT6CAGCT6CG6GAGCTGTC6GAAGCAGAGGTCAGGCAGCATCGG6AAGCCAGGCCC6CCCTGCTGACGTCCA6ACTCCGCTTCATCCCCAAGCCTGA 
IRQHLKRVQLRE LSEAEVRQHREARPALLTSRLRF I PKPD 

CGGGCT6CGGCCGATTGTGAACATGGACTACGTCGTGGGAGCCAGAACGTTCC6CAGA6AAAAGAGGGCCGAGCGTCTCACCTCGAGGGTGAAGGCACTGTTCAGCGTGCTCAACTACGA 
G L R P I V N M 0 y V V G A R T F R R E K R A E R L T S R V K A I F S V L N T E 

GCGGGCGCGGCGCCCCGGCCTCCTGGGCGCCTCTGTGCTGGGCCTGGACGATATCCACAGGGCCTGGCGCACCTTCGTGCTGCGTGTGCGGGCCCAGGACCCGCCGCCTGAGCTGTACTT 
R A R R P G I L G A S V L G I D D I H R A H R T F V L R V R A Q D P P P E L T f 

TGTCAAG GACAGGCTCACGGAGGTCATCGCCAGCATCATCAAACCCCAGAACACGTACTGCGTGCGTCGGTATGCCGTGGTCCA 

V K 0RLTEV1ASI IKPQNTYCVRRYAVVQ 

GAAGGCCGCCCATGGGCACGTCCGCAAGGCCTTCAAGAGCCACGTCTCTACCTTGACAGACCTCCAGCCGTACATGCGACAGTTCGTGGCTCACCTGCAGGAGACCAGCCCGCTGAGGGA 
K A A H G H V R K A F K S H V S T L T D L Q P Y H R 0 F V A H L Q E T S P L R 0 

TGCCGTCGTCATCGAGCAGAGCTCCTCCCTGAATGAGGCCAGCAGTGGCCTCTTCGACGTCTTCCTACGCTTCATGTGCCACCACGCCGTGCGCATCAGGGGCAAGTCCTACGTCCAGTG 
AVVIEQSSSLNEASSGLFDVFLRFMCHHAVRIRGKSYVQC 

CCAGGGGATCCCGCAGGGCTCCATCCTCTCCACGCTGCTCTGCAGCCTGTGCTACGGCGACATGGAGAACAAGCTGTTTGCGGGGATTCGGCGGGACGGGCTGCTCCTGCGTTTGGTGGA 
Q G I P Q G S 1 L S T L L C S L C Y G D M E N K L F A G I R R 0 G L L I R I V 0 

TGATTTCTTGTTGGTGACACCTCACCTCACCCACGCGAAAACCTTCCTCAGGACCCTGGTCCGAGGTGTCCCTGAGTATGGCTGCGTGGTGAACTTGCGGAAGACAGTGGTGAACTTCCC 
0 F L I V T P H L T H A K T F L R T L V R G V P E Y G C V V N L R K T V V N F P 

TGTAGAAGACGAGGCCCTGGGTGGCACGGCTTTTGTTCAGATGCCGGCCCACGGCCTATTCCCCTGGTGCGGCCTGCTGCTGGATACCCGGACCCTGGAGGTGCAGAGCGACTACTCCAG 

V E 0 E A L G G T A F V Q N P A H G L F P W C G L L L 0 T R T L E V Q S D Y S S . 

CTATGCCCGGACCTCCATCAGAGCCAGTCTCACCTTCAACCGCGGCTTCAAGGCTGGGAGGAACATGCGTCGCAAACTCTTTGGGGTCTTGCGGCTGAAGTGTCACAGCCTGTTTCTGGA 

V A R T S I R A S L T F N R G F K A G R N H R R K L F G V L R L K C H S L F I 0 

TTTGCAGGTGAACAGCCTCCAGACGGTGTGCACCAACATCTACAAGATCCTCCTGCTGCAGGCGTACAGGTTTCACGCATGTGTGCTGCAGCTCCCATTTCATCAGCAAGTTTGGAAGAA 
L Q V N S L Q T V C T N i Y K I L I L Q A Y R F H A C V L Q L P F H Q Q V H K N 

CCCCACATTTTTCCTGCGCGTCATCTCTGACACGGCCTCCCTCTGCTACTCCATCCTGAAAGCCAAGAACGCAGGGATGTCGCTGGGGGCCAAGGGCGCCGCCGGCCCICTGCCCTCCGA 
PTFFLRV1SDTASLCYSILKAKNAE 

CCGAAGAAAACATTTCTGTCGTGACTCCTGCGGTGCTTGGGTC 
E E N I L V V T P A V L G S 

GGGACAGCCAGAGATGGAGGCACCCCGCAGACCGTCGGGTGTGGGCAGCTTTCCGGTGTCTCCTGGGAGGGGAGTTGGGCTGGGCCTGTGACTCCTCAGCCTCTGTTT7CCCCCAG 
GQPEMEPPRRPSGVGSFPVSPGRGVGLGL* 



Fig. 11 U 



Truncated telomerase (ver. 2) 




ATGCC6CGCGCTCCCCGCTGCCGAGCCGTGCGCTCCCTGCTGCGCAGCCACTACC6CGAG6TGCTGCCGCTGGCCACGTTCGTG 
H P R A P R C R A V R S L I R S H y R E V L P L A T F V 

CGGCGCCTGGGGCCCCAGGGCTGGCGGCTGGTGCAGCGCGGGGACCCGGCGGCTTTCCGCGCGCTGGTGGCCCAGTGCCTGGTGTGCGTGCCCTGGGACGCACGGCCGCCCCCCGCCGC 
R R L G P Q G V R L ¥ Q R G D P A A F R A L V A Q C L V C V P H 0 A R P P P A A 

GGCCTCCCCGGGGTCGGCGTCCGGCTGGGGTTGAGGGCGGCCGGGGGGAACCAGCGACATGCGGAGAGCAGCGCAGGCGACTCAGGGCGCTTCCCCCGCAGGTG 
G I P G V G V R L G L R A A G G N 0 R H A E S S A G 0 S G R F P R R 
A S P G S A S G W G * G R P G G T S D H R R A A Q A T Q G A S P A G 
PPRGRRPAGVEGGRGEPATCGEQRRRLRALPPQV 

CCCCTCCTTCCGCCAGGTGTCCTGCCTGAAGGAGCTGGTGGCCCGAGTGCTGCAGAGGCTGTGCGAGCGCGGCGCGAAGAACGTGCTGGCCTTCGGCTTCGCGCTGCTGGACGGGGCCCG 
PSFRQVSCLKELVARVIQRLCERGAKNVLAFGFALLOGAR 

CGGGGGCCCCCCCGAGGCCTTCACCACCAGCGTGCGCAGCTACCTGCCCAACACGGTGACCGACGCACTGCGGGGGAGCGGGGCGTGGGGGCTGCTGCTGCGCCGCGTGGGCGACGACGT 
G G P P E A F T T S V R S Y L P N I V T D A L R G S G A H G L I L R R V G 0 D V 

GCTGGTTCACCTGCTGGCACGCTGCGCGCTCTTTGTGCTGGTGGCTCCCAGCTGCGCCTACCAGGTGTGCGGGCCGCCGCTGTACCAGCTCGGCGCTGCCACTCAGGCCCGGCCCCCGCC 
I V H L L A R C A L F V L V A P S C A Y Q V C G P P L Y Q L G A A T Q A R P P P 

ACACGCTAGTGGACCCCGAAGGCGTCTGGGATGCGAACGGGCCTGGAACCATAGCGTCAGGGAGGCCGGGGTCCCCCTGGGCCTGCCAGCCCCGGGTGCGAGGAGGCGCGGGGGCAGTGC 
H A S G P R R R L G C E R A W N H S V R E A G V P I G L P A P G A R R R G G S A 

CAGCCGAAGTCTGCCGTTGCCCAAGAGGCCCAGGCGTGGCGCTGCCCCTGAGCCGGAGCGGACGCCCGTTGGGCAGGGGTCCTGGGCCCACCCGGGCAGGACGCGTGGACCGAGTGACCG 
SRSLPLPKRPRRGAAPEPERTPVGQGSWAHPGRTRGPSOR 

TGGTTTCTGTGTGGTGTCACCTGCCAGACCCGCCGAAGAAGCCACCTCTTTGGAGGGTGCGCTCTCTGGCACGCGCCACTCCCACCCATCCGTGGGCCGCCAGCACCACGCGGGCCCCCC 
G F C V V S P A R P A E E A I S L E G A L S G T R H S H P S V G R Q H H A G P P 

ATCCACATCGCGGCCACCACGTCCCTGGGACACGCCTTGTCCCCCGGTGTACGCCGAGACCAAGCACTTCCTCTACTCCTCAGGCGACAAGGAGCAGCTGCGGCCCTCCTTCCTACTCAG 
STSRPPRPKDTPCPPVYAETKHFLYSSGOKEQLRPSFLLS 

CTCTCTGAGGCCCAGCCTGACTGGCGCTCGGAGGCTCGTGGAGACCATCTTTCTGGGTTCCAGGCCCTGGATGCCA6GGACTCCCCGCAGGTTGCCCCGCCTGCCCCAGCGCTACTGGCA 
S L R P S L T G A R R L V E T I F L G S R P H H P G T P R R L P R I P Q R Y W 0 

AATGCGGCCCCTGTTTCTGGAGCTGCTTGGGAACCACGCGCAGTGCCCCTACGGGGTGCTCCTCAAGACGCACTGCCCGCTGCGAGCTGCGGTCACCCCAGCAGCCGGTGTCTGTGCCCG 
H R P L F L E I L G N H A 0 C P Y G V L L K T H C P L R A A V T P A A G V C A R 

GGAGAAGCCCCAGGGCTCTGTGGCGGCCCCCGAGGAGGAGGACACAGACCCCCGTCGCCTGGTGCAGCTGCTCCGCCAGCACAGCAGCCCCTGGCAGGTGTACGGCTTCGTGCGGGCCTG 
EKPQGSVAAPEEEDIDPRRIVQLLRQHSSPWQVYGFVRAC 

CCTGCGCCGGCTGGTGCCCCCAGGCCTCTGGGGCTCCAGGCACAACGAACGCCGCTTCCTCAGGAACACCAAGAAGTTCATCTCCCTGGGGAAGCATGCCAAGCTCTCGCTGCAGGAGCT 
L R « I V P P G L H G S R H N E R R F I R N T i K F 1 S I G K H A K L S I Q E L 



Fig. 11V 



GAC6TGGAAGATGAGCGTGCG6GACT6CGCTTGGCTGCGCAGGAGCCCAGGGGTTG6CTGTGTTCCGGCCGCAGAGCACCGTCTGC6TGAG6AGATCCTGGCCAAGTTCCTGCACTGGCT 
THKHSVR0CAWLRRSP6VGCVPAAEHRLREE I LAKF LHWL 

GATGAGTGTGTACGTCGTCGAGCTGCTCAGGTCTTTCTTTTATGTCACGGAGACCACGTTTCAAAAGAACAGGCTCTTTTTCTACCGGAAGAGTGTCTGGAGCAAGTTGCAAAGCATTGG 
MSVYVVELIRSFFYVTETIFQKNRLFFYRKSVHSKIQSIG 

AAT-NNN -GACAGTCACCAGGGGGGTTGACCGCCGGACTGGGCGTCCCCAGGGTTGACTATAGGACCAGGTGTCCAGGTGCCCTGCAAGTAGAGGGGCTCTCAGAGGCGTCTGGCTGG 

CATGGGTGGACGTGGCCCCGGGCATGGCCTTCTGCGTGTGCTGCCGTGGGTGCCCTGAGCCCTCACTGAGTCGGTGGGGGCTTGTGGCTTCCCGTGAGCTTCCCCCTAGTCTGTTGTCTG 

GCTGAGCAAGCCTCCTGAGGGGCTCTCTATTG„ 



Fig. 11W 



Truncated protein 1 (ver. 2) 



ATGCC6CGCGCTCCCCGCTGCCGAGCCGT6CGCTCCCTGCTGCGCAGCCACTACCGCGA6GTGCTGCCGCTGGCCAC6TTCGTG 
H P R A P R C R A V R S L I R S H Y R E V L P L A T f V 

CGGCGCCTGGGGCCCCAGGGCTGGCGGCTGGTGCAGCGCGGGGACCCGGCGGCTTTCCGCGCGCTGGTGGCCCAGTGCCTGGTGTGCGTGCCCTGGGACGCACGGCCGCCCCCCGCCGC 
R R L G P 0 G H R L V Q R G D P A A F R A L ¥ A Q C L V C V P H 0 A R P P P A A 



GGCCTCCCCGGGGTCGGCGTCCGGCTGGGGTTGAGGGCGGCCGGGGGGAACCAGCGACATGCGGAGAGCAGCGCAGGCGACTCAGGGCGCTTCCCCCGCAGGTG 
GLPGVGVRLGIRAAGGNQRHAESSAGOSGRFPRR 
A S P G S A S G V G * G R P G G T S D H R R A A Q A T Q G A S P A G 
PPRGRRPAGVEGGRGEPATCGEQRRRLRAIPPQV 

CCCCTCCTTCCGCCAGGTGTCCTGCCTGAAGGAGCTGGTGGCCCGAGTGCTGCAGAGGCTGTGCGAGCGCGGCGCGAAGAACGTGCTGGCCTTCGGCTTCGCGCTGCTGGACGGGGCCCG 
P S F R Q V S C L K E L V A R V I Q R L C E R G A K N V L A F G F A L L 0 G A R 

CGGGGGCCCCCCCGAGGCCTTCACCACCAGCGTGCGCAGCTACCTGCCCAACACGGTGACCGACGCACTGCGGGGGAGCGGGGCGTGGGGGCTGCTGCTGCGCCGCGTGGGCGACGACGT 
G G P P E A F T T S V R S y L P N T V T D A L R G S G A H G L L L R R V G 0 0 V 

GCTGGTTCACCTGCTGGCACGCTGCGCGCTCTTTGTGCTGGTGGCTCCCAGCTGCGCCTACCAGGTGTGCGGGCCGCCGCTGTACCAGCTCGGCGCTGCCACTCAGGCCCGGCCCCCGCC 
L V H L I A R C A L F V L V A P S C A Y Q V C G P P L Y Q L G A A T Q A R P P P 

ACACGCTAGTGGACCCCGAAGGCGTCTGGGATGCGAACGGGCCTGGAACCATAGCGTCAGGGAGGCCGGGGTCCCCCTGGGCCTGCCAGCCCCGGGTGCGAGGAGGCGCGGGGGCAGTGC 
HASGPRRRLGCERAWNHSVREAGVPLGLPAPGARRRGGSA 

CAGCCGAAGTCTGCCGTTGCCCAAGAGGCCCAGGCGTGGCGCTGCCCCTGAGCCGGAGCGGACGCCCGTTGGGCAGGGGTCCTGGGCCCACCCGGGCAGGACGCGTGGACCGAGTGACCG 
SRSLPLPKRPRRGAAPEPERTPVGQGSWAHPGRIRGPSOR 

TGGTTTCTGTGTGGTGTCACCTGCCAGACCCGCCGAAGAAGCCACCTCTTTGGAGGGTGCGCTCTCTGGCACGCGCCACTCCCACCCATCCGTGGGCCGCCAGCACCACGCGGGCCCCCC 
G F C V V S P A R P A E E A T S L E G A L S G T R H S H P S V G R Q H H A G P P 

ATCCACATCGCGGCCACCACGTCCCTGGGACACGCCTTGTCCCCCGGTGTACGCCGAGACCAAGCACTTCCTCTACTCCTCAGGCGACAAGGAGCAGCTGCGGCCCTCCTTCCTACTCAG 
STSRPPRPHDTPCPPVYAETKHFLYSSGOKEQLRPSFLLS 

CTCTCTGAGGCCCAGCCTGACTGGCGCTCGGAGGCTCGTGGAGACCATCTTTCTGGGTTCCAGGCCCTGGATGCCAGGGACTCCCCGCAGGTTGCCCCGCCTGCCCCAGCGCTACTGGCA 
SLRPSITGARRLVETIFLGSRPWHPGTPRRLPRLPQRYWQ 

AATGCGGCCCCTGTTTCTGGAGCTGCTTGGGAACCACGCGCAGTGCCCCTACGGGGTGCTCCTCAAGACGCACTGCCCGCTGCGAGCTGCGGTCACCCCAGCAGCCGGTGTCTGTGCCCG 
MRPLFLELLGNHAQCPYGVLLKTHCPLRAAVTPAAGVCAR 

GGAGAAGCCCCAGGGCTCTGTGGCGGCCCCCGAGGAGGAGGACACAGACCCCCGTCGCCTGGTGCAGCTGCTCCGCCAGCACAGCAGCCCCTGGCAGGTGTACGGCTTCGTGCGGGCCTG 
E K P Q G S V A A P E E E D T 0 P R R L V Q L L R Q H S S P W Q V Y G F V R A C 

CCTGCGCCGGCTGGTGCCCCCAGGCCTCTGGGGCTCCAGGCACAACGAACGCCGCTTCCTCAGGAACACCAAGAAGTTCATCTCCCTGGGGAAGCATGCCAAGCTCTCGCTGCAGGAGCT 
t-RRLVPPGLWGSRHNERRFLRNTKKFiSLGKHAKLSlQEL 



Fig. 11X 



GACGTGGAAGATGAGCGTGCG6GACTGC6CTTGGCTGC6CA6GA6CCCAGG6GTTG6CTGT6TTCCG6CCGCAGAGCACC6TCTGC6T6A6GAGATCCTG6CCAAGTTCCTGCACTGGCT 
TWKMSVROCAHLRRSPGVGCVPAAEHRLREE I LAKFLHWL 

GATGAGTGTGTACGTCGTCGAGCTGCTCAG6TCTTTCTTTTATGTCACGGAGACCACGTTTCAAAAGAACAG6CTCTTTTTCTACCGGAA6AGTGTCTGGAGCAAGTTGCAAAGCATTGG 
MSVYVVELIRSFFYVTETTFQKNRLFFYRKSVWSKLQSIG 

AATCAGACAGCACTTGAAGAGGGTGCAGCTGCGGGAGCTGTCGGAAGCAGAGGTCAGGCAGCATCGGGAAGCCAGGCCCGCCCTGCTGACGTCCAGACTCCGCTTCATCCCCAAGCCTGA 
lRQHIKRVQIRELSEAEVRQHREARPAlLISRLRFIPKPD 



GTGGCTGTGCTTTGGTTTAACTTCCTTTTTAACCAGAA 
V A V L W F T F I F N Q K 

CGGGCTGCGGCCGATTGTGAACATGGACTACGTCGTGGGAGCCAGAACGTTCCGCAGAGAAAAGAGGGCCGAGCGTCTCACCTCGAGGGTGAAGGCACTGTTCAGCGTGCTCAACTACGA 
G L R P 1 V N H 0 y V V G A R T F R R E K R P S V S F R G * 



Fig. 11 Y 



Truncated protein 2 (ver. 2) 



ATGCCGCGCGCTCCCCGCTGCCGAGCCGTGCGCTCCCT6CT6CGCAGCCACTACCGCGA66TGCTGCC6CTGGCCACGTTCGT6 
MPRAPRCRAVRSLIRSHYREVLPLATFV 

CGGCGCCTGGGGCCCCAGGGCTGGCGGCTGGTGCAGCGCGGGGACCCGGCGGCTTTCCGCGCGCTGGTGGCCCAGTGCCTGGTGTGCGTGCCCTGGGACGCACGGCCGCCCCCCGCCGC 
R R L G P Q G H R L V Q R G 0 P A A f R A L V A Q C L V C V P U 0 A R P P P A A 



GGCCTCCCCGGGGTCGGCGTCCGGCTGGGGTTGAGGGCGGCCGGGGGGAACCAGCGACATGCGGAGAGCAGCGCAGGCGACTCAGGGCGCTTCCCCCGCAGGTG 
G L P G V G V R L G L R A A G G N Q R H A E S S A G D S G R F P R R 
A S P G S A S G W G * G R P G G T S D N R R A A Q A T Q G A S P A G 
PPRGRRPAGVEGGRGEPATCGEQRRRLRALPPQV 

CCCCTCCTTCCGCCAGGTGTCCTGCCTGAAGGAGCTGGTGGCCCGAGTGCTGCAGAGGCTGTGCGAGCGCGGCGCGAAGAACGTGCTGGCCTTCGGCTTCGCGCTGCTGGACGGGGCCCG 
P S F R Q V S C I K E L V A R V L Q R L C E R G A K H V L A F G F A L L 0 G A R 

CGGGGGCCCCCCCGAGGCCTTCACCACCAGCGTGCGCAGCTACCTGCCCAACACGGTGACCGACGCACTGCGGGGGAGCGGGGCGTGGGGGCTGCTGCTGCGCCGCGTGGGCGACGACGT 
G G P P E A F T T S V R S Y L P N T V T 0 A L R G S G A W G L L L R R V G 0 D V 

GCTGGTTCACCTGCTGGCACGCTGCGCGCTCTTTGTGCTGGTGGCTCCCAGCTGCGCCTACCAGGTGTGCGGGCCGCCGCTGTACCAGCTCGGCGCTGCCACTCAGGCCCGGCCCCCGCC 
L V H L L A R C A I F V L V A P S C A Y Q V C G P P L Y Q L G A A T Q A R P P P 

ACACGCTAGTGGACCCCGAAGGCGTCTGGGATGCGAACGGGCCTGGAACCATAGCGTCAGGGAGGCCGGGGTCCCCCTGGGCCTGCCAGCCCCGGGTGCGAGGAGGCGCGGGGGCAGTGC 
H A S G P R R R L G C E R A W N H S V R E A G V P L G L P A P G A R R R G G S A 

CAGCCGAAGTCTGCCGTTGCCCAAGAGGCCCAGGCGTGGCGCTGCCCCTGAGCCGGAGCGGACGCCCGTTGGGCAGGGGTCCTGGGCCCACCCGGGCAGGACGCGTGGACCGAGTGACCG 
SRSLPLPKRPRRGAAPEPERTPVGQGSWAHPGRIRGPSDR 

TGGTTTCTGTGTGGTGTCACCTGCCAGACCCGCCGAAGAAGCCACCTCTTTGGAGGGTGCGCTCTCTGGCACGCGCCACTCCCACCCATCCGTGGGCCGCCAGCACCACGCGGGCCCCCC 
G F C V V S P A R P A E E A T S L E G A L S G T R H S H P S V G R 0 H H A G P P 

ATCCACATCGCGGCCACCACGTCCCTGGGACACGCCTTGTCCCCCGGTGTACGCCGAGACCAAGCACTTCCTCTACTCCTCAGGCGACAAGGAGCAGCTGCGGCCCTCCTTCCTACTCAG 
S T S R P P R P W 0 T P C P P V Y A E T K H F I Y S S G 0 K E Q L R P S F L L S 

CTCTCTGAGGCCCAGCCTGACTGGCGCTCGGAGGCTCGTGGAGACCATCTTTCTGGGTTCCAGGCCCTGGATGCCAGGGACTCCCCGCAGGTTGCCCCGCCTGCCCCAGCGCTACTGGCA 
S L R P S L T G A R R L V E T ] f I G S R P W H P G T P R R L P R L P 0 R Y W Q 

AATGCGGCCCCTGTTTCTGGAGCTGCTTGGGAACCACGCGCAGTGCCCCTACGGGGTGCTCCTCAAGACGCACTGCCCGCTGCGAGCTGCGGTCACCCCAGCAGCCGGTGTCTGTGCCCG 
H R P L F L E L L G N H A Q C P Y G V L L I T H C P L R A A V T P A A G V C A R 

GGAGAAGCCCCAGGGCTCTGTGGCGGCCCCCGAGGAGGAGGACACAGACCCCCGTCGCCTGGTGCAGCTGCTCCGCCAGCACAGCAGCCCCTGGCAGGTGTACGGCTTCGTGCGGGCCTG 
E K P Q G S V A A P E E E 0 T 0 P R R L V Q L L R Q H S S P W Q V Y G f V R A C 

CCTGCGCCGGCTGGTGCCCCCAGGCCTCTGGGGCTCCAGGCACAACGAACGCCGCTTCCTCAGGAACACCAAGAAGTTCATCTCCCTGGGGAAGCATGCCAAGCTCTCGCTGCAGGAGCT 
L R R L V P P G L H G S R H N E R R F L R N T K K F I S L G K H A K L S L Q E L 



Fig. 11Z 



